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The Applicants appeal the rejection of Claims 22-26 in the above-captioned patent 
application. These claims were rejected in a Final Office Action mailed April 25, 2005. 
Applicants filed a Notice of Appeal July 22, 2005. Applicants file concurrently herewith an 
Amendment After Final Office Action. 

I. REAL PARTY IN INTEREST 

Pursuant to 37 C.F.R. 41.37(c)(1), Appellants hereby notify the Board of Patent Appeals 
and Interferences that the real party in interest is the assignee of record for this application, 
Genentech, Inc., 1 DNA Way, South San Francisco, CA 94080. 

II. RELATED APPEALS AND INTERFERENCES 

A Notice of Appeal has been filed in the related Application No. 10/032996. Appellants 
are unaware of any other related appeals or interferences. 
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III. STATUS OF THE CLAIMS 

The above-captioned application was filed with Claims 1-21. Claims 1-21 were canceled 
and new Claims 22-27 were added in a Preliminary Amendment mailed December 27, 2001. 
Claim 27 was canceled in an Amendment and Response to Final Office Action dated December 
6, 2004. Claims 22-26 were finally rejected by the Examiner in a final Office Action mailed 
April 25, 2005. Appellants file concurrently herewith an Amendment After Final Office Action 
amending Claim 22. Accordingly, Claims 22-26 are the subject of this appeal. The claims 
attached hereto as Appendix A reflect the claims as amended by the Amendment After Final 
Office Action filed concurrently herewith. 

IV. STATUS OF AMENDMENTS 

Appellants mailed a "Request for Continued Examination" on March 7, 2005, submitting 
the claims with an amendment to Claim 22, and offering additional evidence in support of 
Appellants' arguments. In a Final Office Action mailed April 25, 2005, the Examiner indicated 
that the submission filed on March 9, 2005 had been entered. Appellants file concurrently 
herewith an Amendment After Final Office Action further amending Claim 22. 

V. SUMMARY OF THE CLAIMED SUBJECT MATTER 

The claimed subject matter relates to isolated antibodies which specifically bind to the 
polypeptide having SEQ ID NO:7. As amended, with the amended text shown underlined, 
independent Claim 22 reads: 

22. An isolated antibody that specifically binds to the polypeptide of SEQ ID NO:7. 

Various aspects of the claimed antibody are described in the specification at, for example, 
page 12, lines 8-9; page 31, lines 10-12; page 44, lines 13-18; page 46, line 25 through page 48, 
line 7; and page 90, line 20 through page 98, line 30. SEQ ID NO: 7 is disclosed in the Sequence 
Listing appended to the application. 

VI. GROUNDS OF REJECTION TO BE REVIEWED ON APPEAL 

The Examiner has rejected Claims 22-26 under 35 U.S.C. §101, stating that the claimed 
invention is not supported by either a specific and substantial asserted utility or a well- 
established utility. 
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The Examiner has also rejected Claims 22-26 under 35 U.S.C. §112, first paragraph. The 
Examiner asserts "that it would require undue experimentation for one of skill in the art to 
perform the method of the claim as broadly written." 

VII. APPELLANTS 9 ARGUMENT 

A. Summary of Areuments 

1. Utility Rejection 

The first issue before the Board is whether Appellants have asserted at least one "specific, 
substantial, and credible utility." See Examination Guidelines, 66 Fed. Reg. 1092 (2001). 
Appellants have asserted that the claimed antibodies to the polypeptide of SEQ ID NO: 7 (the 
PR0539 polypeptide) are useful as diagnostic tools for cancer, particularly for lung and colon 
cancer. This asserted utility is specific, substantial, and credible. 

Briefly stated, Appellants' asserted utility is based on the disclosure in Example 16 of the 
instant application that the gene encoding the PR0539 polypeptide is amplified at least two-fold 
in the majority of the lung and colon tumors tested compared to normal tissue. It is well- 
established that gene amplification is correlated with an overexpression of the corresponding 
mRNA and the encoded polypeptide. Thus, one of skill in the art would be more likely than not 
to believe that the amplification of the PR0539 gene in a majority of lung and colon tumors 
leads to an increase in PR0539 mRNA and protein in these lung and colon tumors compared to 
their normal tissue counterpart. This differential expression of PR0539 mRNA and polypeptide 
is useful for distinguishing lung or colon tumor tissue from its normal tissue counterpart. 
Therefore, the claimed antibodies to the PR0539 polypeptide have a specific, substantial and 
credible utility as diagnostic tools for cancer, particularly lung and colon cancer, as is explained 
in more detail below. 

2. Enablement Rejection 

The second issue before the Board is whether Appellants have enabled the pending 

claims such that one of skill in the art would be able to make and use the claimed invention. The 

Examiner has rejected Claims 22-26 under 35 U.S.C. §112, first paragraph, arguing that the 

claimed subject matter was not described in the specification in such a way as to enable one 
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skilled in the art to make and/or use the invention. Office Action at 7. The Examiner recites the 
factors for determining enablement from In re Wands, 858 F.2d 731, 8 U.S.P.Q. 2d 1400 (Fed. 
Cir. 1988), relying to a large extent on the arguments made above in support of the rejection for 
lack of utility. 

Appellants submit that Claims 22-26 are enabled such that one of skill in the art could 
make and use the claimed antibodies without undue experimentation. It is well within those of 
skill in the art to make antibodies which are specific to a disclosed sequence, and the Examiner 
has not stated otherwise. See also In re Wands, 858 F.2d 731 (reversing the Board's decision of 
non-enablement and holding that as of 1980, undue experimentation was not required to make 
high-affinity monoclonal antibodies to a target peptide). 

Appellants assert that the claimed antibodies are useful as diagnostic tools for cancer, 
particularly lung and colon cancer based on the amplification of the PR0539 gene in lung and 
colon tumors. As detailed below, it is well-established that gene amplification is correlated with 
an overexpression of the encoded polypeptide, and thus it is likely that the PR0539 polypeptide 
is overexpressed in lung and colon cancer. Thus, based on the disclosure in the application, one 
of skill in the art would be able to use the claimed antibodies as diagnostic tools to detect 
overexpression of the PR0539 polypeptide in suspected lung and colon tumors without undue 
experimentation. 

B. Utility Rejection -Detailed Arguments 

The first issue before the Board is whether Appellants have asserted at least one "specific, 
substantial, and credible utility." See Examination Guidelines, 66 Fed. Reg. 1092 (2001). 
Appellants have asserted that the claimed antibodies to the polypeptide of SEQ ID NO: 7 (the 
PR0539 polypeptide) are useful as diagnostic tools for cancer, particularly for lung and colon 
cancer. This asserted utility is specific, substantial, and credible, as is explained in more detail 
below. 

1. Utility - Leml Standard 

A "specific utility" is defined as utility which is "specific to the subject matter claimed," 
in contrast to "a general utility that would be applicable to the broad class of the invention." See 
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M.P.E.P. § 2107.01 I. For example, it is generally not enough to state that a nucleic acid is 
useful as a diagnostic tool without also identifying the condition that is to be diagnosed. 

The requirement of "substantial utility" defines a "real world" use, and derives from the 
Supreme Court's holding in Brenner v. Manson, 383 U.S. 519, 534 (1966) stating that "[t]he 
basic quid pro quo contemplated by the Constitution and the Congress for granting a patent 
monopoly is the benefit derived by the public from an invention with substantial utility." In 
explaining the "substantial utility" standard, M.P.E.P. § 2107.01 cautions, however, that Office 
personnel must be careful not to interpret the phrase "immediate benefit to the public" or similar 
formulations used in certain court decisions to mean that products or services based on the 
claimed invention must be "currently available" to the public in order to satisfy the utility 
requirement. "Rather, any reasonable use that an applicant has identified for the invention that 
can be viewed as providing a public benefit should be accepted as sufficient, at least with regard 
to defining a 'substantial* utility." M.P.E.P. § 2107.01 (emphasis added). 

Indeed, the Guidelines for Examination of Applications for Compliance With the Utility 
Requirement, set forth in M.P.E.P. § 2107 11(B)(1) gives the following instruction to patent 
examiners: "If the applicant has asserted that the claimed invention is useful for any particular 
practical purpose . . . and the assertion would be considered credible by a person of ordinary skill 
in the art, do not impose a rejection based on lack of utility." 

Finally, in assessing the credibility of the asserted utility, the M.P.E.P. states that "to 
overcome the presumption of truth that an assertion of utility by the applicant enjoys" the PTO 
must establish that it is "more likely than not that one of ordinary skill in the art would doubt (i.e., 
'question') the truth of the statement of utility." M.P.E.P. § 2107.02 III A. 

2. Utility - Burden of Proof 

It is well established that a specification which contains a disclosure of utility which 
corresponds in scope to the subject matter sought to be patented "must be taken as sufficient to 
satisfy the utility requirement of § 101 for the entire claimed subject matter unless there is reason 
for one skilled in the art to question the objective truth of the statement of utility or its scope." In 
re Langer, 503 F.2d 1380, 1391, 183 U.S.P.Q. 288, 297 (C.C.P.A. 1974). Thus "the PTO has the 
initial burden of challenging a presumptively correct assertion of utility in the disclosure." In re 
Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995). Only after the PTO provides 
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evidence showing that one of ordinary skill in the art would reasonably doubt the asserted utility 
does the burden shift to the applicant to provide rebuttal evidence sufficient to convince such a 
person of the invention's asserted utility. Id. 

3. Utility - Standard of Proof 

Compliance with 35 U.S.C. § 101 is a question of fact. Raytheon v. Roper, 724 F.2d 951, 

956, 220 U.S.P.Q. 592, 596 (Fed. Cir. 1983). The evidentiary standard to be used throughout ex 

parte examination in setting forth a rejection is a preponderance of the evidence, or "more likely 

than not" standard. In re Oetiker, 977 F.2d 1443, 1445, 24 U.S.P.Q.2d 1443, 1444 (Fed. Cir. 

1992). This is stated explicitly in the M.P.E.P.: 

[T]he applicant does not have to provide evidence sufficient to establish that an 
asserted utility is true "beyond a reasonable doubt." Nor must the applicant 
provide evidence such that it establishes an asserted utility as a matter of 
statistical certainty. Instead, evidence will be sufficient if, considered as a whole, 
it leads a person of ordinary skill in the art to conclude that the asserted utility is 
more likely than not true . M.P.E.P. § 2107.02, part VII (emphasis in original, 
citations omitted). 

The Court of Appeals for the Federal Circuit has stated that the standard for satisfying the 

utility requirement is a low one: 

The threshold of utility is not high : An invention is "useful" under section 101 if 
it is capable of providing some identifiable benefit. See Brenner v. Manson, 383 
U.S. 519, 534, 86 S.Ct. 1033, 16 L.Ed.2d 69 (1966); Brooktree Corp. v. Advanced 
Micro Devices, Inc., 977 F.2d 1555, 1571 (Fed. Cir. 1992) ("To violate § 101 the 
claimed device must be totally incapable of achieving a useful result"); Fuller v. 
Berger, 120 F. 274, 275 (7th Cir. 1903) (test for utility is whether invention "is 
incapable of serving any beneficial end"). Juicy Whip, Inc. v. Orange Bang, Inc., 
185 F.3d 1364, 1366, 51 U.S.P.Q. 2d 1700 (Fed. Cir. 1999) (emphasis added). 

The low threshold for satisfying the utility requirement is reflected in the standard set by the 
Federal Circuit for invalidating a patent based on a lack of utility: "[T]he fact that an invention 
has only limited utility and is only operable in certain applications is not grounds for finding lack 
of utility. Some degree of utility is sufficient for patentability. Further, the defense of non- 
utility cannot be sustained without proof of total incapacity ." Envirotech Corp. v. Al George, 
Inc., 730 F.2d 753, 762, 221 U.S.P.Q. 473 (Fed. Cir. 1984) (emphasis added, citations omitted). 

Because the standard for satisfying the utility requirement is so low, requiring total 
incapacity for a finding of no utility, the M.P.E.P. cautions that: 
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Rejections under 35 U.S.C. 101 have been rarely sustained by federal courts. 
Generally speaking, in these rare cases, the 35 U.S.C. 101 rejection was sustained 
[] because the applicant . . . asserted a utility that could only be true if it violated a 
scientific principle, such as the second law of thermodynamics, or a law of nature, 
or was wholly inconsistent with contemporary knowledge in the art. M.P.E.P. § 
2107.02 III B., citing In re Gazave, 379 F.2d 973, 978, 154 U.S.P.Q. 92, 96 
(C.C.P.A. 1967) (underline emphasis in original, italic emphasis added). 

4. Appellants Asserted a Specific. Substantial and Credible Utility that is 
Sufficient to Satisfy the Utility Requirement ofS 101 

The claimed subject matter relates to antibodies which specifically bind to the 
polypeptide having SEQ ID NO: 7. The polypeptide of SEQ ID NO: 7 (referred to as "PR0539 
polypeptide") is encoded by the polynucleotide of SEQ ID NO: 6 (also referred to as 
DNA47465-1561). Specification at 35, lines 20-23. Appellants have asserted that the claimed 
antibodies are useful as diagnostic tools for cancer, particularly lung and colon cancer. 

In "Example 16: Gene Amplification" Appellants disclose that the gene encoding 
PR0539 polypeptide is amplified at least 2-fold in a majority of lung and colon tumors tested 
compared to normal tissue, respectively (a ? Ct value of 1 corresponds to a 2-fold increase, 2 ? Ct 
corresponds to 4-fold, 3 ?Ct corresponds to 8-fold, etc.). Specification at 111, line 35 through 
page 117. As explained on page 112, line 6 through page 113, line 31 of the specification, the 
amplification of the PR0539 gene was detected using the well-established technique of 
quantitative TaqMan PCR amplification of genomic DNA isolated from lung and colon tumors. 
See also, Declaration of Audrey Goddard f Ph.D. 

The specification teaches that amplification of a PRO polypeptide-encoding gene in one 

or more tumor tissues as compared normal tissue "is associated with overexpression of the gene 

product, indicating that the polypeptides are useful targets for therapeutic intervention... and 

diagnostic determination of the presence of those cancers." Specification at 112, lines 0-3. 

Likewise, Appellants disclose the use of antibodies to PRO polypeptides as diagnostic tools: 

[A]nti-PRO antibodies may be used in diagnostic assays for PRO, e.g., detecting 
its expression in specific cells, tissues, or serum. Various diagnostic assay 
techniques known in the art may be used, such as competitive binding assays, 
direct or indirect sandwich assays and immunoprecipitation assays conducted in 
either heterogeneous or homogeneous phases. Specification at 98, lines 6-9. 

Taken together, the specification clearly discloses the use of the claimed antibodies as 

diagnostic tools for cancer, particularly lung and colon cancer. This utility is substantial, as one 
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of skill in the art will recognize that the diagnosis of cancer is a "real world" use; it is specific, as 
the diagnosis of lung and colon cancer is not a utility that applies to the broad class of antibodies; 
and it is credible, as it is not a utility "that could only be true if it violated a scientific principle, 
...or a law of nature, or [is] wholly inconsistent with contemporary knowledge in the art." 
M.REP. § 2107.02 III B., citing In re Gazave, 379 F.2d 973, 978, 154 U.S.P.Q. 92, 96 (C.C.P.A. 
1967). 

Because Appellants' specification contains a disclosure of utility which corresponds in 
scope to the claimed subject matter, antibodies which specifically bind to the polypeptide of SEQ 
ID NO: 7, the asserted utility "must be taken as sufficient to satisfy the utility requirement of § 
101 for the entire claimed subject matter unless there is reason for one skilled in the art to 
question the objective truth of the statement of utility or its scope." In re hanger, 503 F.2d 1380, 
1391, 183 U.S.P.Q. 288, 297 (C.C.P.A. 1974). Therefore, the burden of establishing a prima 
facie case of lack of utility rests with the PTO. See, In re Brana, 51 F.3d 1560, 1566, 34 
U.S.P.Q.2d 1436 (Fed. Cir. 1995) ("the PTO has the initial burden of challenging a 
presumptively correct assertion of utility in the disclosure"). 

5. The Examiner's Arguments 

To establish a prima facie showing that the claimed subject matter lacks utility, the 
Examiner must "provide[] evidence showing that one of ordinary skill in the art would 
reasonably doubt the asserted utility." In re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 
(Fed. Cir. 1995). The Examiner has filed a first Office Action on April 19, 2004, a final Office 
Action on September 8, 2004, an Advisory Action on December 20, 2004, and a second final 
Office Action on April 25, 2005 after Appellants' Request for Continued Examination. None of 
these papers provide sufficient evidence that one of ordinary skill in the art would reasonably 
doubt the asserted utility. 

The final Office Action on April 25, 2005, contains all of the Examiner's arguments in 
support of the utility rejection made in the previous office actions. Appellants summarize these 
arguments below. 
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a. Appellants' evidence is allegedly deficient 

The Examiner has asserted that that the evidence of PR0539 gene amplification is 
deficient, stating: 

The level of overexpression of PRO 539 was minimal, not even a two fold 
overexpression in any cell type. Further, no statistical data was presented to show 
that the overexpression was significant in any way with no P-value or other 
statistical measure to demonstrate that, the overexpression was a real effect and 
not simply produced by chance. Office Action at 3 1 . 

b. The Examiner asserts that there is no "necessary" connection between 
DNA amplification. mRNA level and protein level 

The critical argument for the Examiner's rejection is that "the overexpression of the 
nucleic acid is not relevant to the utility of the protein and antibody." Office Action at 3. The 
Examiner cites Meric et al (Mol. Cancer Ther. (2002) 1:971-979) Gokman -Polar et al (Cancer 
Res. (2001) 61:1375-1381) and Pennica et al (Proc. Natl. Acad. Sci. USA (1998) 95:14717- 
14722) to support his assertion that "not only is there no necessary connection between the level 
of protein in a cell and the amount of mRNA, but there is also no necessary correlation between 
the amount of DNA in a cell and the amount of mRNA." Office Action at 4 (emphasis added). 
Therefore, the Examiner concludes that "any evidence by Applicant showing overexpression of 
one component does not provide utility for the protein itself." Id. at 4-5. 

c. The Examiner asserts that a protein's function cannot be predicted from 
its sequence 

The Examiner also argues that "given the breadth of these claims which encompass any 
antibody to the Pro-539 polypeptide, there is an abundance of evidence that very similar proteins 
can perform very different functions." Office Action at 5. The Examiner cites Rost et al. (J. Mol. 
Biol. (2002) 318(2):595-608) to support the assertion that "even high levels of homology do not 
necessarily correlate with actual protein function. In the current case, where the function of 
PRO-539 (SEQ ID NO: 7) is not known, the expectation is even lower that there is any utility 
that can be derived based upon the sequence." Id. 



1 Citation to "Office Action" refers to the final Office Action mailed April 25, 2005 
unless otherwise indicated. 
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d. The Examiner asserts that the instant case is similar to Examples 4 and 
12 of the Utility Guidelines 

The Examiner argues that the instant case is extremely similar to Example 12 of the 
Utility Guidelines where a protein which was known to be a receptor, but where the ligand was 
unknown, was found to lack utility. The Examiner argues that "[i]n the current case, the putative 
PRO-539 protein, lacks any substantial utility whatsoever, and solely relies upon an small level 
of mRNA overexpression in cancer cells. However, there is no necessary relationship between 
the protein levels or utilities and such an overexpression of the nucleic acid." Office Action at 5 
(emphasis added). The Examiner concludes that further research would be required to identify 
and reasonably confirm a "real world" context of use for PRO-539 antibodies. Without 
explanation, the Examiner also asserts that the instant case "directly tracks" Example 4 of the 
Utility Guidelines as well. Office Action at 6. 

e. The Examiner asserts that the claimed invention lacks a specific utility 
In addition to the arguments above that the claimed invention lacks a substantial utility, 

the Examiner also asserts that the claimed invention lacks a specific utility. The Examiner 
argues that "the antibody to the protein, as distinguished from the nucleic acid, has not been 
associated with any disease, any condition, or any other specific feature. There is no association 
of the antibody or protein with cancer or with any other disease." Office Action at 6. The 
Examiner states that "a general statement of diagnostic utility, such as diagnosing an unspecified 
disease, would ordinarily be insufficient absent a disclosure of what condition can be 
diagnosed." Id. 

6. The Examiner has not established a prima facie case that Claims 22-26 lack 
Utility 

The above arguments do not satisfy the Examiner's burden to "provide[] evidence 
showing that one of ordinary skill in the art would reasonably doubt the asserted utility." In re 
Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995). The Examiner has the burden 
of presenting "countervailing facts and reasoning sufficient to establish that a person of ordinary 
skill would not believe the applicant 's assertion of utility." M.P.E.P. §2107.02 IILA., citing in 
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re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995) ("Only after the PTO 
provides evidence showing that one of ordinary skill in the art would reasonably doubt the 
asserted utility does the burden shift to the applicant to provide rebuttal evidence"). The 
Examiner's argument hinges on his assertion that there is no "necessary connection" or 
"necessary correlation" between gene amplification and mRNA or protein overexpression. The 
Examiner cites three references to support this assertion. 

The Examiner's arguments fail for two reasons. First, the lack of a "necessary," i.e., 
precise or perfect, connection is not sufficient to establish that one of skill in the art would 
reasonably doubt Appellants' asserted utility. Appellants do not need to establish the utility 
"beyond a reasonable doubt" or to a "statistical certainty." Instead, Appellants need show that 
one of ordinary skill in the art would conclude that the asserted utility is more likely than not true . 
See M.P.E.P. § 2107.02, part VII. Thus, the lack of a "necessary" connection does not mean that 
Appellants' asserted utility is not true under a more likely than not standard. For example, if the 
presence of A results in the presence of B in 9 out of 10 cases there is no "necessary" connection 
between A and B, yet one would believe that it is more likely than not true that given A, B will 
be present. 

The second reason the Examiner's arguments fail is that the references relied on by the 
Examiner are not contrary to Appellants' asserted utility, and in fact actually support Appellants' 
position. Because the references relied on by the Examiner offer little or no support for his 
assertions, the Examiner has failed to establish a prima facie showing that one of skill in the art 
would reasonably doubt the asserted utility, and the Board should accept Appellants' disclosed 
utility as sufficient. Appellants address each of the Examiner's arguments in turn. 

a. The data in Example 16 are sufficient to establish the asserted utility 
Appellants turn first to the Examiner's arguments challenging the reliability of the data 
reported in Example 16. The Examiner makes two arguments regarding the sufficiency of the 
data in Example 16: (1) he argues that the level of overexpression of PR0539 was minimal, "not 
even a two fold overexpression in any cell type;" and (2) that there is no statistical analysis to 
show that the data were significant. Office Action at 3. Appellants submit that the gene 
amplification data provided in Example 16 are sufficient to establish a specific and substantial 
utility for the PR0539 polypeptide and antibody. 
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As described in Example 16 of the present application, the inventors isolated genomic 
DNA from a variety of primary cancers and cancer cell lines. As a negative control, DNA was 
isolated from the blood of normal healthy individuals. Specification at 112, lines 7-9. Gene 
amplification was monitored using real-time quantitative TaqMan™ PCR. The gene 
amplification results are set forth in Table 7 (Table 8 as amended) on page 117. As explained in 
the specification on page 112, lines 17-19, the results of TaqMan™ PCR are reported in ACt 
units. It is well-known in the art that "Ct" stands for "threshold cycle." One Ct unit corresponds 
to one PCR cycle or approximately a 2-fold amplification, relative to control, 2 units correspond 
to 4-fold amplification, 3 units to 8-fold amplification, etc. Specification at 112, lines 17-19. 

Looking at the results reported in on page 117, nine primary lung tumors and eight 
primary colon tumors were tested, as well as a number of tumor cell lines. PR0539 had a ACt 
value of greater than 1, i.e. A more than two-fold amplification , in six of nine lung tumors and five 
of eight colon tumors. These data show that in more than half of the lung and colon tumors 
tested, the gene for PR0539 was amplified at least two-fold. Thus, the Examiner's statement 
that the overexpression of PR0539 was minimal, "not even a two fold overexpression in any cell 
type" is simply wrong. 

The Examiner has also rejected the data because he questions the statistical significance 

of the data. However, Appellants are not required to prove utility to a statistical certainty, only 

that it is more likely than not true. See Nelson v. Bowler, 626 F.2d 853, 856-57, 206 U.S.P.Q. 

881, 883-84 (C.C.P.A. 1980) (reversing the Board and rejecting an argument that evidence of 

utility was insufficient because it was not statistically significant). As the M.P.E.P. states: 

[T]he applicant does not have to provide evidence sufficient to establish that an 
asserted utility is true "beyond a reasonable doubt." Nor must the applicant 
provide evidence such that it establishes an asserted utility as a matter of 
statistical certainty. Instead, evidence will be sufficient if, considered as a 
whole, it leads a person of ordinary skill in the art to conclude that the asserted 
utility is more likely than not true . M.P.E.P. § 2107.02, part VII (bold emphasis 
added, underline in original, citations omitted). 

Therefore, whether the results are statistically significant or not is irrelevant to 
establishing the asserted utility. The results must simply be reliable enough that one of skill in 
the art would believe that the utility is more likely than not true. 

In support of Appellants' assertion of utility, Appellants submitted a copy of the 

declaration of Dr. Audrey Goddard with exhibits A-G (the Goddard Declaration). As Dr. 
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Goddard's curriculum vitae, Exhibit A of the Goddard Declaration, shows, she is an expert in the 
art of identifying and quantifying the amplification of oncogenes in cancers. 
In her declaration, Dr. Goddard states that 

[T]he quantitative TaqMan PCR technique is technically sensitive enough to 
detect at least a 2-fold increase in gene copy number relative to control. It is 
further my considered scientific opinion that ... a gene identified as being 
amplified at least 2-fold by the quantitative TaqMan PCR assay in a tumor sample 
relative to a normal sample is useful as a marker for the diagnosis of cancer , for 
monitoring cancer development and/or for measuring the efficacy of cancer 
therapy. Goddard Declaration at ^ 7 (emphasis added). 

Therefore, according to Dr. Goddard, an expert in the field, a 2-fold or more increase, i.e., 
a ACt value of 1 or greater, is significant and reliable enough to be useful for the diagnosis of 
cancer. The ACt value for PR0539 is greater than 1 for the majority of primary lung and colon 
tumors tested, indicating a more than two-fold amplification. Specification at page 117. 

The Examiner has not offered any substantial reason or evidence to reject either the 
underlying data or Dr. Goddard's conclusions. Instead, the Examiner has dismissed the Goddard 
declaration as "fundamentally flawed/' arguing the it fails to provide specific evidence regarding 
PR0539. Final Office Action mailed 9/8/04 at 11. While it is true that Dr. Goddard's 
Declaration discusses the significance of the data genetically without specific reference to 
PR0539, her declaration read in light of the data reported in Example 16 clearly make her 
declaration applicable to PR0539. 

Dr. Goddard is an expert in the field whose declaration is based on personal knowledge 
of the relevant facts at issue. Appellants' have reminded the Examiner that "Office personnel 
must accept an opinion from a qualified expert that is based upon relevant facts whose accuracy 
is not being questioned." M.P.E.P. § 2107 (emphasis added). In addition, declarations relating 
to issues of fact should not be summarily dismissed as "opinions" without an adequate 
explanation of how the declaration fails to rebut the Examiner's position. See in re Alton 76 F.3d 
1168 (Fed. Cir. 1996). Therefore, the Examiner should accept Dr. Goddard's statement that "a 
gene identified as being amplified at least 2-fold by the quantitative TaqMan PCR assay in a 
tumor sample relative to a normal sample is useful as a marker for the diagnosis of cancer." 

Interestingly, the above argument along with the Goddard Declaration were presented to 

Examiner Fredman in the closely related and co-owned patent application Serial No. 10/033,167 

to overcome a nearly identical 35 U.S.C. § 101 utility rejection of claims directed in part to the 
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nucleic acid sequence encoding the PR0539 protein. That application contains the same data 
presented and relied on in the instant application. In that case, the Examiner concluded that the 
Goddard declaration was "sufficient to overcome the rejection of the claims based upon utility 
and enablement as discussed above." Office Action mailed 9/9/2003, page 5. 

Thus, it is the uncontested opinion of an expert in the field that the results in Example 16 
are significant and reliable enough to indicate that the PROS 3 9 gene is useful as a diagnostic tool 
for cancer, and the PTO has accepted this utility in a related application. 

In conclusion, Appellants submit that the evidence reported in Example 16, supported by 
the Goddard Declaration, establish that the PR0539 gene is amplified at least two-fold in the 
majority of lung and colon tumors tested and can be used as a marker for the diagnosis of cancer. 
The Examiner has not offered any significant arguments or evidence to the contrary, and 
therefore has not established a prima facie case that one of skill in the art would reasonably 
doubt the significance of the data in Example 16, or the assertion that the PR0539 gene can be 
used as a diagnostic tool. For the reasons discussed below, based on these data Appellants' have 
established a specific and substantial utility for the PROS39 polypeptide and antibody as well. 

b. The Examiner's cited references are not contrary to Appellants 9 
assertion that sene amplification 2enerally leads to increased mRNA 
and protein levels 

Appellants next address the heart of the Examiner's rejection - his assertion that "not 
only is there no necessary connection between the level of protein in a cell and the amount of 
mRNA, but there is also no necessary correlation between the amount of DNA in a cell and the 
amount of mRNA. Therefore, any evidence by Applicant showing overexpression of one 
component does not provide utility for the protein itself ." Office Action at 4-5 (emphasis added). 

As an initial matter, Appellants emphasize that the level of proof required to establish 
utility is relatively low. "[T]he applicant does not have to provide evidence sufficient to 
establish that an asserted utility is true 'beyond a reasonable doubt.' Nor must the applicant 
provide evidence such that it establishes an asserted utility as a matter of statistical certainty." 
M.P.E.P. § 2107.02, part VII (citations omitted). Therefore, there does not have to be a 
"necessary," precise or perfect correlation between gene amplification and increased mRNA and 



- 14- 



Appl. No. : 10/033,396 

FUed : December 27, 2001 



protein levels. Instead, the evidence must be sufficient to convince a skilled artisan that the 
asserted utility is "more likely than not true." Id. 

Thus, even if the references cited by the Examiner supported the conclusion that there is 
no "necessary" connection between gene amplification, mRNA, and protein levels, this is not the 
requisite standard which Appellants must meet to satisfy their burden. Instead, there need only 
be evidence that the asserted utility is "more likely than not true." As discussed above, it does 
not follow that one of skill in the art would not believe that the asserted utility is more likely than 
not true even //"the connection between gene amplification and the level of mRNA and protein is 
not "necessary." Stated another way, the correlation between gene amplification and the level of 
mRNA and protein does not have to be exact for the asserted utility to be more likely than not 
true. 

Keeping the standard of proof in mind, we turn to the three references the Examiner cites 
as support for his assertion that there is no "necessary" connection between gene amplification, 
the amount of mRNA and the amount of protein. The first is an article by Meric et al The 
Examiner states that "in a discussion of regulation of gene activity in cancer [Meric] notes that 
'Gene expression is quite complicated, however, and is also regulated at the level of mRNA 
stability, mRNA translation and protein stability (page 971, column 1).'" Office Action at 4. 
From this statement, the Examiner concludes that "Meric teaches that there is not necessarily a 
correlation between mRNA levels and protein levels in cancer cells, since the regulation may 
occur at levels other than that of the mRNA, such as in the level of translation of the mRNA or in 
the stability of the protein." Office Action at 3-4. 

What the Examiner ignores is the preceding statement by the authors: 

The fundamental principle of molecular therapeutics in cancer is to exploit the 
differences in gene expression between cancer cells and normal cells... [M]ost 
efforts have concentrated on identifying differences in gene expression at the 
level of mRNA . which can be attributable to either DNA amplification or to 
differences in transcription. Meric at 971 (emphasis added). 

This statement supports Appellants' asserted utility. First, Meric teaches that although 
there is more than one mechanism, changes in mRNA in tumors cells can be attributed to gene 
amplification. This clearly supports Appellants' assertion that gene amplification leads to 
increase in mRNA level. 
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Second, Meric states that examining changes in mRNA levels is a "fundamental 
principle" of molecular cancer therapeutics. It is true that there is no necessary , i.e., absolute or 
perfect, correlation between gene expression and protein expression because there are other 
mechanism for regulating gene expression. However, exploiting differences in gene expression 
between cancer cells and normal cells would not be a "fundamental principle" of molecular 
cancer therapeutics if there were no significant correlation between gene expression and protein 
levels. Stated another way, changes in mRNA without corresponding changes in protein levels 
would have little or no effect on cellular biology, and those of skill in the art would have no 
reason to examine the differences in gene expression at the mRNA level without such a 
correlation. However, as one of skill in the art recognizes, there is a strong correlation between 
changes in mRNA and changes in protein level. It is because of this strong correlation that it 
remains a "fundamental principle" of molecular therapeutics in cancer to look at changes in 
mRNA level. 

Thus, while Meric does teach that there is no "necessary" correlation between changes in 
mRNA and changes in protein, it supports Appellants' assertion that generally there is a good 
correlation between the two. Appellants reiterate that they need not prove an asserted utility is 
true "beyond a reasonable doubt" or as a matter of "statistical certainty." The evidence must 
lead a person of ordinary skill in the art to conclude that the asserted utility is " more likely than 
not true ." M.P.E.P. § 2107.02, part VII. Because the correlation does not have to be absolute for 
one of skill in the art to believe the asserted utility is more likely than not true, Meric does not 
support the Examiner's rejection, and does not satisfy his burden of presenting "countervailing 
facts and reasoning sufficient to establish that a person of ordinary skill would not believe the 
applicant's assertion of utility." M.P.E.P. §2107.02 III.A., citing in re Brana, 51 F.3d 1560, 
1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995). 

The Examiner's second citation is an article by Gokman-Polar. The Examiner asserts 
that the article "teaches 'Quantitative reverse transcript ion-PCR analysis revealed that PKC 
mRNA levels do not directly correlate with PKC protein levels, indicating that PKC isozyme 
expression is likely regulated at the posttranscriptional/translational level (see abstract).'" Office 
Action at 4. The Examiner states that Gokman -Polar shows in figures 6 and 7 that "there is no 
increase in mRNA expression for any of the isozymes, while the protein is significantly 
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overexpressed as shown by figures 4 and 5. This demonstrates that there is no relationship 
between mRNA levels and protein levels." Office Action at 4 (emphasis added). 

Appellants submit that like Meric, Gokman -Polar is not contrary to Appellants' assertion 
that gene amplification generally leads to overexpression of the mRNA and protein. Appellants 
emphasize that they are asserting that gene amplification leads to an increase in PR0539 mRNA, 
which results in increased PR0539 protein in the tumors where PR0539 is amplified. 
Appellants are not asserting that every change in protein level is a result of gene amplification or 
an increase in mRNA expression - Appellants acknowledge that there are other ways to increase 
protein levels. Therefore, the lack of an large increase in PKC BII mRNA accompanying an 
increase in PKC BII protein levels, cited by the Examiner as support for his rejection, is not very 
relevant to Appellants' asserted utility since it does not establish that increased mRNA does not 
lead to increased protein. Instead, it shows that not all protein increases are a result of increased 
mRNA. 

In addition, a close review of the entire Gokman -Polar article indicates that the trend in 
the data is that changes in mRNA levels are positively correlated with protein levels, supporting 
Appellants' assertion that the two are generally correlated. In Figure 6, the mRNA levels of two 
isozymes shows a decrease, while the third is increased slightly. This same pattern is seen for 
the corresponding protein levels in Figure 2, although as the Examiner points out, the increase in 
mRNA for the third isozyme is small compared to the increase in protein level. Similarly, 
comparing the mRNA levels of the three isozymes in Figure 7 to the corresponding protein 
levels in Figure 4, with one exception the changes in mRNA levels are positively correlated to 
changes in protein levels. While protein levels do not increase or decrease in direct proportion to 
the changes in mRNA, the trend in five of the six examples is that changes in mRNA levels are 
positively correlated to changes in protein levels. 

This evidence is hardly sufficient to establish that one of skill in the art would reasonably 
doubt that there is a reasonable correlation between changes in mRNA levels and changes in the 
corresponding protein levels. First, it is evidence taken from only two genes, one of which has 
two splice variants. This is hardly sufficient evidence to refute Applicants' assertion that in 
general, increases in mRNA lead to increases in the corresponding protein. Second, even within 
this small sample, five of the six data points show a positive correlation between changes in 
mRNA and protein levels - decreases in mRNA are associated with decreases in protein, while 
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increases in mRNA are associated with increases in protein. Therefore, Gokman -Polar does not 
satisfy the Examiner's burden to establish that one of skill in the art would reasonably doubt the 
asserted utility, and it definitely does not support the Examiner's obviously false statement that 
"there is no relationship between mRNA levels and protein levels." Office Action at 4 (emphasis 
added). 

Finally, the Examiner cites Pennica et al, stating that the article "shows that WISP-2 
DNA was amplified in cancer cells but was actually demonstrated REDUCED RNA expression 
(see abstract)." Office Action at 4 (emphasis in original). Based on this statement, the Examiner 
concludes that u [t]his provides additional evidence that there is no relationship between gene 
amplification and mRNA levels, since mRNA levels have no necessary correlation with gene 
amplification." Id. (emphasis added). 

The Examiner's conclusion that there is no relationship between gene amplification and 

mRNA levels is not supported by the reference. Pennica reports that: 

An analysis of WISP-1 gene amplification and expression in human colon tumors 
showed a correlation between DNA amplification and overexpression , whereas 
overexpression of WISP-3 RNA was seen in the absence of DNA amplification. 
In contrast, WISP-2 DNA was amplified in the colon tumors, but its mRNA 
expression was significantly reduced in the majority of tumors compared with the 
expression in normal colonic mucosa from the same patient." Pennica at 14722, 
col. 1 (emphasis added). 

Importantly, the authors report that the WISP-1 gene was amplified approximately 2-fold, similar 
to the gene for PR0539 reported in Example 16. Id. at 14720, col. 2. 

Appellants first note that overexpression of WISP-3 RNA seen in the absence of DNA 
amplification is not relevant to the Appellants' asserted utility - Appellants are not asserting that 
amplification is the only source of gene overexpression. This leaves the results reported for 
WISP-1, where gene amplification (approximately two-fold) is correlated with overexpression of 
the mRNA, and WISP-2, where gene amplification was apparently not correlated with gene 
overexpression. 

The authors of Pennica offer an explanation for what they obviously viewed as an 
anomalous result for WISP-2: "Because the center of the 20ql3 amplicon [of which WISP-2 is a 
part] has not yet been identified, it is possible that the apparent amplification observed for 
WISP-2 may be caused by another gene in this amplicon." Pennica at 14722, col. 1 (emphasis 
added). Thus, the only example of a lack of positive correlation between gene amplification and 
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RNA overexpression relied on by the Examiner may be an artifact. The fact that the authors 
attempt to explain this anomaly only supports Appellants' argument that the accepted 
understanding in the art is that there is a positive correlation between gene amplification and 
overexpression of the mRNA. 

As stated above, the standard for utility is not absolute or even statistical certainty, but 
rather whether one of skill in the art would believe that the asserted utility is more likely than not 
true. One apparent contrary example, when combined with the positive example reported in the 
same reference, is not sufficient to prove that a person of skill in the art would have a reasonable 
doubt that gene amplification is not generally correlated with increased gene expression. The 
Examiner has not shown whether the apparent lack of correlation observed for one of the two 
amplified genes studied is typical, or is merely an exception to the rule of correlation. Indeed, 
the authors' attempt to explain the lack of correlation suggests that this result is the exception, 
and not the rule. 

In summary, Pennica provides one example of a positive correlation between gene 
amplification and increased gene expression, and one possible example of a lack of correlation. 
This single reference reporting a single gene that may show a lack of correlation between gene 
amplification and mRNA overexpression cannot support the Examiner's conclusion that there is 
"no relationship between gene amplification and mRNA levels" - the results of WISP-1 clearly 
show that in some cases (at least half of those relied on by the Examiner) there is a relationship. 
For the same reason, Appellants submit that Pennica is not sufficient to satisfy the Examiner's 
initial burden of establishing that it is more likely than not that one of skill in the art would doubt 
Appellants' assertion that generally, gene amplification leads to overexpression of the mRNA 
and protein. 

Taken together, the three references relied on by the Examiner are not sufficient to 
support his conclusion that "the overexpression of the nucleic acid is not relevant to the utility of 
the protein and antibody." Office Action at 3. First, Meric supports Appellants' assertion that 
generally, changes in mRNA lead to a corresponding change in the level of the encoded protein - 
that is why examining differences between tumor and normal tissue at the mRNA level is a 
"fundamental principle" of molecular cancer therapeutics. Likewise, Meric teaches that mRNA 
overexpression can be attributed to gene amplification. Second, Gokman-Polar reports a positive 
correlation between changes in mRNA level and changes in protein level for five of six samples 
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tested. Far from supporting the Examiner's conclusion that there is "no relationship" between 
mRNA and protein, this evidence tends to support Appellants' assertion that changes in mRNA 
and protein are correlated. Finally, the only reference which looked at a correlation between 
gene amplification and gene expression reports one gene where there was a strong correlation 
between the two, and one possible example where there was a lack of positive correlation. This 
evidence is at best inconclusive, with at least half the genes showing a correlation between gene 
amplification and mRNA overexpression. 

While these references may establish that there is no "necessary" correlation between 
gene amplification and overexpression of mRNA and protein, Appellants are not required to 
establish the asserted utility beyond a reasonable doubt or with a statistical certainty. Thus, even 
assuming that the Examiner has proved that there is no "necessary" correlation, this does not 
mean that he has met his initial burden of establishing that it is "more likely than not" that a 
skilled artisan would doubt the asserted utility. Given that two of the cited references actually 
tend to support the Appellants' position, Appellants assert that the Examiner has failed to meet 
his burden of establishing a prima facie case of lack of utility. 

c. The Examiner's arguments regarding the unpredictability of protein 
function from protein sequence are irrelevant 

Appellants next address the Examiner's argument that "there is an abundance of evidence 
that very similar proteins can perform very different functions" and that "even high levels of 
homology do not necessarily correlate with actual protein function. In the current case, where 
the function of PRO-539 (SEQ ID NO: 7) is not known, the expectation is even lower that there 
is any utility that can be derived based upon the sequence." Office Action at 5. 

Appellants' asserted utility does not rely on the function of the PR0539 protein, or any 
relation between the function of PR0539 and its sequence. The claimed subject matter relates to 
antibodies which specifically bind to the polypeptide of SEQ ID NO:7. The Appellants' asserted 
utility is the use of the claimed antibodies as diagnostic tools for cancer, based on the two-fold 
amplification of the PR0539 gene in a majority of the lung and colon tumors tested. This utility 
in no way depends on the function of the PR0539 protein, making the Examiner's arguments 
irrelevant. 
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d. The instant case is significantly different from Examples 4 and 12 of the 
Utility Guidelines 

The Examiner argues that the instant case is extremely similar to Example 12 of the 
Utility Guidelines where a protein which was known to be a receptor, but where the ligand was 
unknown, was found to lack utility. The Examiner argues that "[i]n the current case, the putative 
PRO-539 protein, lacks any substantial utility whatsoever, and solely relies upon an small level 
of mRNA overexpression in cancer cells. However, there is no necessary relationship between 
the protein levels or utilities and such an overexpression of the nucleic acid." Office Action at 5 
(emphasis added). The Examiner also asserts that the instant case "directly tracks" Example 4 of 
the Utility Guidelines as well. Office Action at 6. 

First, as stated above, Appellants are not required to establish their asserted utility 
"beyond a reasonable doubt" or to a "statistical certainty." Therefore, any lack of a " necessary" 
relationship between protein levels and overexpression of the nucleic acid does not establish that 
one of skill in the art is more likely than not to doubt the asserted utility. Second, for the reasons 
stated above, the Examiner's references do not establish that one of skill in the art would doubt 
Appellants' assertion that gene amplification generally leads to overexpression of the mRNA and 
protein. For this reason, the instant case is significantly different from Examples 4 and 12 of the 
Utility Guidelines. 

In Example 4, nothing more than the sequence of the protein was disclosed. There was 
no disclosed utility, or description of the chemical, physical or biological properties of the 
protein other than its sequence. Similarly, in Example 12, the specification discloses that an 
isolated protein is a receptor for an uncharacterized protein. The isolated protein is not 
characterized with respect to its biological function or any disease or body condition that is 
associated with the protein. 

In contrast to Examples 4 and 12, in the instant case the specification teaches that the 
gene encoding the PR0539 protein is amplified at least two-fold in the majority of lung and 
colon tumors tested. Appellants have asserted that one of skill in the art would recognize that it 
is likely that the PR0539 protein is therefore also overexpressed in a majority of lung and colon 
tumors, an assertion supported by expert declarations and references discussed below. This 
biological activity distinguishes the instant case from Examples 4 and 12. 
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Instead, the instant case is more like the caveat to Example 12, where the additional 
information that the isolated protein is found on the cell membranes of melanoma cells, but not 
normal skin cells is disclosed. The Utility Guidelines state that this additional information is 
sufficient to establish utility for antibodies to the protein as diagnostic tools. Similarly, the likely 
overexpression of PR0539 protein in lung and colon tumors is also sufficient to establish utility 
for the protein and antibodies. 

e. The asserted utility is specific 

Finally, Appellants address the Examiner's argument that "the antibody to the protein, as 
distinguished from the nucleic acid, has not been associated with any disease, any condition, or 
any other specific feature. There is no association of the antibody or protein with cancer or with 
any other disease." Office Action at 6. The Examiner states that "a general statement of 
diagnostic utility, such as diagnosing an unspecified disease, would ordinarily be insufficient 
absent a disclosure of what condition can be diagnosed." Id. 

Specific Utility is defined as utility which is "specific to the subject matter claimed," in 
contrast to "a general utility that would be applicable to the broad class of the invention." 
M.P.E.P. § 2107.01, part I (2004). Appellants submit that the evidence of amplification and 
overexpression of PR0539 nucleic acids (see below) in lung and colon tumors along with the 
declarations and references discussed below provide a specific utility for the claimed antibodies. 
As discussed in more detail below, Appellants have established a reasonable correlation between 
gene amplification and overexpression of the mRNA and protein. The Examiner has not 
provided any substantial evidence or arguments to the contrary. This makes antibodies to the 
PR0539 protein useful in diagnosing lung and colon cancer. This is not a "a general statement 
of diagnostic utility" that would apply to the broad class of antibodies. 

/ Conclusion - Examiner has failed to establish a prima facie case that 
one of skill in the art would doubt Appellants 9 asserted utility 
The Examiner has relied on essentially two unsupported arguments in rejecting the 
pending claims for lack of utility. First, the Examiner has questioned the sufficiency, reliability 
and significance of the data reported in Example 16. Second, the Examiner has argued that 
because there is no " necessary " connection between gene amplification and mRNA levels, and 
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no " necessary " connection between mRNA levels and protein levels, evidence of gene 
amplification does not provide utility for the protein. Appellants have responded to each of these 
arguments in turn. 

First, Appellants have established that the PR0539 gene is amplified at least two-fold in 
a majority of lung and colon tumors tested. The Goddard declaration establishes that the data in 
Example 16 are sufficiently reliable and significant to show that the PR0539 gene is useful as a 
cancer diagnostic tool. The Examiner has not provided any substantial reason or evidence for 
one of skill in the art to doubt the reliability or usefulness of the data in Example 16, or the facts 
and conclusions in the Goddard declaration. 

Second, Appellants have shown that the three references relied on by the Examiner as 
support for his assertion that gene amplification is not relevant to the utility of the protein and 
antibody do not support the Examiner's position, are not contrary to Appellants' assertion of a 
general correlation between gene amplification and mRNA and protein overexpression, and in 
fact lend support to Appellants' position. 

Finally, Appellants have adequately addressed the Examiners' remaining arguments 
regarding a lack of structure function-relationships in proteins; the similarity of the instant case 
to Examples 4 and 12 of the Utility Guidelines, and the asserted lack of a specific utility. 

Taken together, the Examiner's arguments are not sufficient to satisfy the Examiner's 
burden to "provide[] evidence showing that one of ordinary skill in the art would reasonably 
doubt the asserted utility." In re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 
1995). The Examiner's arguments are largely conclusory statements which are not supported by 
any substantial evidence or reasoning which explains why one of ordinary skill in the art would 
reasonably doubt the asserted utility. And even the scant evidence that is provided supports the 
Appellants' position as much or more than the Examiner's position. Therefore, the Board should 
accept the Appellants' disclosure of utility. See Ex parte Rubin, 5 U.S.P.Q. 2d 1461 (Bd. Pat. 
App. & Interf. 1987) ("There is no factual support in this record for the examiner's questioning 
of the denaturation test reported in the specification. ... No reason to doubt 'the objective truth' 
of the asserted utility having been advanced by the examiner, we accept appellant's disclosure of 
utility corresponding in scope to the claimed subject matter."). 
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7. Appellants have provided Sufficient Rebuttal Evidence of Utility 

"Only after the PTO provides evidence showing that one of ordinary skill in the art would 

reasonably doubt the asserted utility does the burden shift to the applicant to provide rebuttal 

evidence." In re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995). The rebuttal 

evidence must be sufficient such that when it is considered as a whole, it is more likely than not 

that the asserted utility is true. See In re Oetiker, 977 F.2d 1443, 1445, 24 U.S.P.Q.2d 1443, 

1444 (Fed. Cir. 1992) (stating that the evidentiary standard to be used throughout ex parte 

examination in setting forth a rejection is a preponderance of the evidence, or "more likely than 

not" standard). The M.P.E.P. summarizes the standard of proof required: 

[T]he applicant does not have to provide evidence sufficient to establish that an 
asserted utility is true "beyond a reasonable doubt." Nor must the applicant 
provide evidence such that it establishes an asserted utility as a matter of 
statistical certainty. Instead, evidence will be sufficient if, considered as a whole, 
it leads a person of ordinary skill in the art to conclude that the asserted utility is 
more likely than not true . M.P.E.P. § 2107.02, part VII (emphasis in original, 
citations omitted). 

Appellants remind the Board that the Federal Circuit has stated that the standard for satisfying 
the utility requirement is a low one: "The threshold of utility is not high: An invention is 'useful' 
under section 101 if it is capable of providing some identifiable benefit." Juicy Whip, Inc. v. 
Orange Bang, Inc., 185 F.3d 1364, 1366, 51 U.S.P.Q. 2d 1700 (Fed. Cir. 1999). 

Even if the Examiner has satisfied his burden of presenting a prima facie case of lack of 
utility, Appellants have supplied more than enough rebuttal evidence, such that when considered 
as a whole, one of skill in the art would conclude that the asserted utility is more likely than not 
true. As discussed in detail below, Appellants have provided sufficient evidence that the gene 
encoding the PR0539 polypeptide is amplified and overexpressed in lung and colon tumors and 
can therefore be used as a diagnostic tool. In addition, Appellants have shown that it is well 
established in the art that there is a reasonable correlation between gene amplification, changes 
in mRNA level and changes in the corresponding protein level such that one of skill in the art 
would believe that the PR0539 polypeptide is also overexpressed in lung and colon cancers. 
Therefore, considering the evidence as a whole, one of skill in the art would believe that it is 
more likely than not that the claimed antibodies are useful as diagnostic tools for cancer, 
particularly lung and colon tumors. 
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cu Appellants have established that the PRQ539 zene is amplified at least 
two-fold in the majority of lunz and colon tumors tested 

As discussed above, the Examiner has not provided any evidence to challenge the 
reliability and significance of the data in Example 16 which reports that the PR0539 gene is 
amplified at least two-fold in a majority of lung and colon tumors tested compared to normal 
tissue. In contrast to this complete lack of evidence on the part of the Examiner, Appellants have 
submitted the Goddard Declaration. That declaration establishes that it is the opinion of an 
expert in the field who has personal knowledge of the facts surrounding Example 16 that a two- 
fold or greater amplification of a gene in tumors makes the gene useful as a diagnostic tool. The 
Examiner has not provided any evidence to challenge the facts and conclusions of the Goddard 
Declaration in support of Example 16. 

Given the disclosure of Example 16 and the supporting Goddard Declaration on the one 
hand, and the complete lack of any evidence or arguments on the other, it is clear that 
considering the evidence as a whole, one of skill in the art would conclude that it is more likely 
than not that the PR0539 gene is amplified at least two-fold in the majority of lung and colon 
tumors tested compared to normal tissue, and therefore the PR0539 gene is useful as a 
diagnostic tool for lung and colon cancer. 

As Appellants explain below, it is more likely than not that the PR0539 mRNA and 
polypeptide are overexpressed in lung and colon tumors, and can therefore be used to distinguish 
tumor tissue from normal tissue. This provides utility for the claimed antibodies to the PR0539 
polypeptide. 

b. Appellants have established that generally there is a correlation between 
sene amplification and mRNA overexpression 
Appellants turn next to the evidence offered in support of their assertion that it is well- 
established that gene amplification generally leads to overexpression of the corresponding 
mRNA. The teachings in Genes V, a leading textbook in the field, illustrate that at the time the 
instant application was filed, it was well known by those of skill in the art that gene amplification 
leads to overexpression of the corresponding gene product. See Benjamin Lewin, Genes V, 5 th ed. 
1994, pages 1196-1201. In a section entitled "Insertion, translocation, or amplification may 
activate proto-oncogenes", the text describes various molecular events that lead to 
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overexpression of a gene product, using the c-myc gene as an example. The first mechanism 
taught is insertion of a retrovirus upstream of the gene which causes it to be driven by a more 
efficient promoter, resulting in increased mRNA and protein levels. Next, Lewin teaches that 
chromosomal translocations may bring a gene to a new region where it is actively expressed, 
resulting in increased gene and protein expression. The third mechanism whereby protein levels 
of oncogenes are overexpressed is gene amplification. The text emphasizes that the common 
thread among the different means of activation of proto-oncogenes is that the expression of the 
gene is increased . Thus, as of 1994, it was well-known in the art that gene amplification is 
correlated with overexpression of the corresponding mRNA and encoded protein. 

Additional information regarding the understanding of those of skill in the art regarding 
the relationship between gene amplification and protein overexpression at the time the instant 
application was filed is found in Alitalo (Med. BioL, (1984) 62:304-317), and Merlino et al. (J. 
Clin. Invest., (1985) 75:1077-1079). Under the heading "Enhanced Expression of Amplified 
Oncogenes," Alitalo states that "[i]n all cases where they have been studied, the amplified 
oncogenes have been found abundantly expressed at the mRNA level roughly in proportion to 
the amount of DNA amplification (see Table 1)." Alitalo at 313 (emphasis added). Table 1 lists 
eleven examples of amplified oncogenes where expression levels were examined. In all eleven 
cases, expression of the amplified oncogene was elevated . Thus, Alitalo clearly teaches that 
amplification leads to overexpression. Merlino et al studied epidermoid carcinoma cells, and 
teach that amplification of the EGF receptor gene results in increased levels of EGF receptor 
mRNA and increased levels of EGF receptor protein. Taken together, the excerpt from Genes V, 
as well as the Alitalo and Merlino references, establish that as of the filing date of the instant 
application, those of skill in the art appreciated the correlation between gene amplification and 
overexpression of the encoded gene product. 

The teachings of Genes V, Alitalo, and Merlino are confirmed in several more recent 
reports that also document the correlation between gene amplification and levels of protein. 
Appellants have submitted two more recent studies providing evidence that the teachings 
referred to above are still widely accepted by those of skill in the art. Orntoft et al. (Molecular 
and Cellular Proteomics, (2002) 1:37-45), studied transcript levels of 5600 genes in malignant 
bladder cancers which were linked to a gain/loss of chromosomal material using an array-based 
method. Orntoft et al showed that there was a gene dosage effect and teach that "in general (18 
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of 23 cases) chromosomal areas with more than 2-fold gain of DNA showed a corresponding 
increase in mRNA transcripts." Orntoft at 37, column 1, abstract. In addition, Hyman et al 
(Cancer Research, (2002) 62:6240-6245) used CGH analysis and cDNA microarrays to compare 
DNA copy numbers and mRNA expression of over 12,000 genes in breast cancer tumors and cell 
lines. They showed that there is "evidence of a prominent global influence of copy number 
changes on gene expression levels." Hyman at 6244, column 1, last paragraph. 

Additional supportive teachings are also provided by Pollack et aL (PNAS, (2002) 
99:12963-12968) who studied a series of primary human breast tumors and found that "[b]y 
analyzing mRNA levels in parallel, we have also discovered that changes in DNA copy number 
have a large, pervasive, direct effect on global gene expression patterns in both breast cancer cell 
lines and tumors." Pollack at 12967 at column 1, (emphasis added). Their study found that 
"62% of highly amplified genes show moderately or highly elevated expression, that DNA copy 
number influences gene expression across a wide range of DNA copy number alterations 
(deletion, low-, mid- and high-level amplification), that on average, a 2-fold change in DNA 
copy number is associated with a corresponding 1.5-fold change in mRNA levels." Pollack at 
12963, column 1, abstract. 

Appellants have also submitted two references which show that gene amplification leads 
to overexpression of the encoded protein. Bahnassy et al (BMC Gastroenterology, (2004) 4:22- 
34), studied the amplification of cyclin Dl and assessed the levels of the encoded protein by 
immunohistochemistry. Bahnassy et al found a "significant correlation between cyclin Dl gene 
amplification and protein overexpression." Bahnassy at 27, column 1. Similarly, Blancato et al 
(British Journal of Cancer, (2004) 90(8), 1612-1619), report that overexpression of c~myc mRNA 
and c-Myc protein is related to the copy number of the c-myc amplification. Blancato at 1613, 
column 2. Bahnassy and Blancato demonstrate continued evidentiary support for the widely- 
accepted principle that gene amplification correlates with overexpression of the encoded protein. 
Presumably, amplification leads to overexpression of the encoded protein by increasing the 
amount of corresponding mRNA. 

Together, these excerpts and articles collectively teach that it is more likely than not that 
gene amplification increases mRNA expression. This evidence establishes that there is a 
reasonable correlation between gene amplification and gene overexpression, and one of skill in 
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the art would believe, to a reasonable probability, that gene amplification would lead to increased 
gene expression. 

This conclusion is supported by the declaration of Victoria Smith, Ph.D., an expert in the 
field of Molecular Biology. Exhibit B of her Declaration reports the results of microarray 
analysis conducted on the cDNA of the gene encoding PR0539 (DNA47465). The results 
indicate that the PR0539 mRNA is significantly overexpressed in eight of the twenty-six lung 
tumor samples tested compared to the normal lung tissue controls. That is the equivalent of 
nearly one in every three samples (31%). In addition, four out of five squamous cell lung 
carcinomas (80%) are significantly overexpressed (shown in bold). In contrast, only one of the 
seven individual normal lung tissue samples shows significant overexpression of the PR0539 
gene (14%). The tumors analyzed in the microarray are not the same tumors reported in 
Example 16, and therefore it is not possible to directly relate gene amplification to mRNA 
overexpression. However, these data do support the conclusion that PR0539 mRNA is 
overexpressed in lung tumors, particularly squamous cell lung carcinomas. 

Dr. Smith states that "[i]t is well-established in the art that overexpression of the mRNA 
for a gene is likely to lead to overexpression of the corresponding protein." Smith Declaration at 
If 6. She explains: 

Given the known correlation between overexpression of a gene and the 
corresponding overexpression of the encoded protein, it is very likely that a 
similar number of lung tumors will overexpress the PR0539 protein, while very 
few normal lung tissue samples likely will. Together with the data reported in 
Example 16 that the gene encoding PR0539 is amplified in some lung tumors, 
including squamous cell lung carcinoma, the results reported in Exhibit B indicate 
that the PR0539 gene and protein, as well as antibodies to the encoded protein, 
can be used to differentiate some cancerous lung tissue, particularly squamous 
cell carcinoma, from normal lung tissue. Smith Declaration at f 7 (emphasis in 
original). 

Because not all lung tumors show overexpression of PR0539, it cannot be used to 
exclude a sample being tested as non-cancerous. However, Dr. Smith states that the PR0539 
gene, protein, and corresponding antibodies are useful as a diagnostic tool for lung cancer, 
particularly squamous cell carcinoma, since a very high percentage of squamous cell lung 
carcinomas overexpress the gene and most likely the encoded protein, while very few normal 
lung samples do. 
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As discussed above, the Examiner relies on a single apparently contrary example of one 
gene reported by Pennica et al to support the assertion that there is "no relationship between 
gene amplification and mRNA levels, since mRNA levels have no necessary correlation with 
gene amplification." Office Action at 4 (emphasis added). As Appellants discussed above, this 
possible contrary example is balanced by the report in Pennica that the amplification of the 
WISP-1 gene by about two-fold led to overexpression of the corresponding mRNA. Thus, 
Pennica is at best inconclusive with one example of a positive correlation and one possible 
contrary example. If the example of WISP-2 relied on by the Examiner is an artifact as the 
authors suggest is possible, the Pennica reference actually supports Appellants' position. 

As stated above, the standard for utility is not absolute or even statistical certainty, but 
rather whether considering the evidence as a whole, one of skill in the art would be more likely 
than not to believe the asserted utility. Even if Pennica supported the PTO's argument, which it 
does not, one contrary example is not sufficient to prove that a person of skill in the art would 
have a reasonable doubt that gene amplification is not correlated to gene expression. Given the 
evidence provided by the Appellants which establishes that there is a correlation between gene 
amplification and mRNA expression, one of skill in the art would believe, to a reasonable 
probability, that the reported amplification of the PR0539 gene would lead to an increase in the 
level ofPR0539mRNA. 

c. Appellants have established that generally there is a correlation between 
changes in mRNA expression levels and changes in the expression level 
of the encoded protein 
Appellants turn to the next portion of their argument in support of their asserted utility - 
that it is well-established in the art that in most cases a change in the level of mRNA for a 
particular protein leads to a corresponding change in the level of the encoded protein. Given 
Appellants' evidence of two-fold amplification and overexpression of the PR0539 gene in 
certain lung and colon tumors, it is more likely than not that the PR0539 polypeptide is likewise 
overexpressed, and therefore the claimed antibodies are useful as diagnostic tools, particularly 
for lung and colon tumor. 

In support of the assertion that changes in mRNA are positively correlated to changes in 
protein levels, Appellants submitted a Declaration by J. Christopher Grimaldi, an expert in the 
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field of cancer biology. As stated in paragraph 5 of the declaration, "Those who work in this 

field are well aware that in the vast majority of cases, when a gene is over-expressed... the gene 

product or polypeptide will also be over-expressed.... This same principal applies to gene 

under-expression." Grimaldi Declaration at If 5. Further, "increased mRNA expression is 

expected to result in increased polypeptide expression, and the detection of decreased mRNA 

expression is expected to result in decreased polypeptide expression." Id. 

Appellants also submitted the declaration of Paul Polakis, Ph.D. an expert in the field of 

cancer biology. As stated in paragraph 6 of his declaration: 

Based on my own experience accumulated in more than 20 years of research, 
including the data discussed in paragraphs 4 and 5 above [showing a positive 
correlation between mRNA levels and encoded protein levels in the vast majority 
of cases studied in relation to the present invention] and my knowledge of the 
relevant scientific literature, it is my considered scientific opinion that for human 
genes, an increased level of mRNA in a tumor cell relative to a normal cell 
typically correlates to a similar increase in abundance of the encoded protein in 
the tumor cell relative to the normal cell. In fact, it remains a central dogma in 
molecular biology that increased mRNA levels are predictive of corresponding 
increased levels of the encoded protein. Polakis Declaration at U 6 (emphasis 
added). 

Dr. Polakis acknowledges that there are published cases where such a correlation does not exist, 
but states that it is his opinion, based on over 20 years of scientific research, that "such reports 
are exceptions to the commonly understood general rule that increased mRNA levels are 
predictive of corresponding increased levels of the encoded protein." Polakis Declaration at 6. 

As mentioned above, the Smith Declaration also states that "[i]t is well-established in the 
art that overexpression of the mRNA for a gene is likely to lead to overexpression of the 
corresponding protein." Smith Declaration at ^ 6. She explains that "[g]iven the known 
correlation between overexpression of a gene and the corresponding overexpression of the 
encoded protein" it is very likely that those lung tumors that overexpress the PR0539 mRNA 
will also overexpress the PR0539 protein, and this indicates "that the PR0539 gene and protein, 
as well as antibodies to the encoded protein, can be used to differentiate some cancerous lung 
tissue, particularly squamous cell carcinoma, from normal lung tissue." Smith Declaration at 7. 

The Declarations of Grimaldi, Polakis and Smith are supported by the teachings in 
Molecular Biology of the Cell, a leading textbook in the field (Bruce Alberts, et al., Molecular 
Biology of the Cell (3 rd ed. 1994) (hereinafter "Cell 3 rd ") and (4 th ed. 2002) (hereinafter "Cell 
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4 )). Figure 9-2 of Cell 3 r shows the steps at which eukaryotic gene expression can be 
controlled. The first step depicted is transcriptional control. Cell 3 rd provides that "[f]or most 
genes transcriptional controls are paramount. This makes sense because, of all the possible 
control points illustrated in Figure 9-2, only transcriptional control ensures that no superfluous 
intermediates are synthesized." Cell 3 rd at 403 (emphasis added). In addition, the text states that 
"Although controls on the initiation of gene transcription are the predominant form of regulation 
for most genes , other controls can act later in the pathway from RNA to protein to modulate the 
amount of gene product that is made." Cell 3 rd at 453 (emphasis added). Thus, as established in 
Cell 3 rd , the predominant mechanism for regulating the amount of protein produced is by 
regulating transcription. 

In Cell 4 th , Figure 6-3 on page 302 illustrates the basic principle that there is a correlation 
between increased gene expression and increased protein expression. The accompanying text 
states that "a cell can change (or regulate) the expression of each of its genes according to the 
needs of the moment - most obviously by controlling the production of its mRNA ." Cell 4 th at 
302 (emphasis added). Similarly, Figure 6-90 on page 364 of Cell 4 th illustrates the path from 
gene to protein. The accompanying text states that while potentially each step can be regulated 
by the cell, " the initiation of transcription is the most common point for a cell to regulate the 
expression of each of its genes ." Cell 4 th at 364 (emphasis added). This point is repeated on 
page 379, where the authors state that of all the possible points for regulating protein expression, 

th 

" [f]or most genes transcriptional controls are paramount ." Cell 4 at 379 (emphasis added). 

Further support for Appellants' position can be found in the textbook, Genes VI, 
(Benjamin Lewin, Genes VI (1997)) which states "having acknowledged that control of gene 
expression can occur at multiple stages, and that production of RNA cannot inevitably be 
equated with production of protein, it is clear that the overwhelming majority of regulatory 
events occur at the initiation of transcription ." Genes VI at 847-848 (emphasis added). 

Additional support is also found in Zhigang et aL, World Journal of Surgical Oncology 
2:13, 2004. Zhigang studied the expression of prostate stem cell antigen (PSCA) protein and 
mRNA to validate it as a potential molecular target for diagnosis and treatment of human 
prostate cancer. The data showed "a high degree of correlation between PSCA protein and 
mRNA expression" Zhigang at 4. Of the samples tested, 81 out of 87 showed a high degree of 
correlation between mRNA expression and protein expression. The authors conclude that "it is 
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demonstrated that PSCA protein and mRNA overexpressed in human prostate cancer, and that 

the increased protein level of PSCA was resulted from the upregulated transcription of its 

mRNA." Id. at 6. Even though the correlation between mRNA expression and protein 

expression occurred in 93% of the samples tested, not 100%, the authors state that "PSCA may 

be a promising molecular marker for the clinical prognosis of human Pea and a valuable target 

for diagnosis and therapy of this tumor." Id. at 7. 

Further, as noted above, Meric et al, Molecular Cancer Therapeutics, vol. 1, 971-979 

(2002), states the following: 

The fundamental principle of molecular therapeutics in cancer is to exploit the 
differences in gene expression between cancer cells and normal cells... [M]ost 
efforts have concentrated on identifying differences in gene expression at the 
level of mRNA, which can be attributable to either DNA amplification or to 
differences in transcription. Meric et al at 971 (emphasis added). 

Those of skill in the art would not be focusing on differences in gene expression between cancer 
cells and normal cells if there were no correlation between gene expression and protein 
expression. 

Together, the declarations of Grimaldi, Polakis, and Smith, the accompanying references, 
and the excerpts and references discussed above all establish that the accepted understanding in 
the art is that there is a reasonable correlation between changes in gene expression and changes 
in the level of the encoded protein. In contrast to this substantial amount of evidence supporting 
Appellants' position, the Examiner has two references, Meric et al and Gokman-Polar et al. 
However, as discussed above, Meric and Gokman-Polar actually support Appellants' position, 
not the Examiner's. It is clear that when considered as a whole, the preponderance of the 
evidence clearly weighs in favor of Appellants. 

d. The Examiner has failed to consider all of the evidence of record as a 
whole 

Even assuming that the Examiner has established a prima facie case that one of skill in 
the art would doubt the truth of the asserted utility, Appellants have presented sufficient rebuttal 
evidence. The rebuttal evidence "will be sufficient if, considered as a whole , it leads a person of 
ordinary skill in the art to conclude that the asserted utility is more likely than not true . M.P.E.P. 
§ 2107.02, part VII (emphasis in original, citations omitted). 
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The Examiner has not considered the evidence as a whole. Instead, the Examiner has 
summarily stated that the Grimaldi and Polakis Declarations are "fundamentally flawed because 
they fail to provide specific evidence regarding Pro-539." Office Action mailed 9/8/04 at 1 1. In 
response to the Declaration of Dr. Smith which has specific evidence regarding PR0539, the 
Examiner is silent, completely ignoring her declaration. In response to the extensive references 
cited by the Appellants, the Examiner has summarily dismissed them as "nonspecific arguments" 
that do not relate to PR0539. Office Action at 16. He argues that "[n]one of the references 
demonstrate that there is a 'reasonable probability' that the Pro-539 protein is overexpressed or 
that antibodies to the Pro-539 protein itself have any utility." Id. Instead of weighing the 
evidence as a whole as required, the Examiner summarily states that "[a]s noted in the rejection, 
there are other articles which demonstrate that there is no necessary relationship [between 
nucleic acid overexpression and protein overexpression] for every protein." Id. (emphasis 
added). 

Appellants have established that the PR0539 gene is amplified at least two-fold in the 
majority of lung and colon tumors tested. Appellants have submitted substantial evidence in the 
form of additional data, expert declarations, reference articles and textbooks, to support their 
assertion that it is more likely than not that this gene amplification leads to overexpression of the 
PR0539 mRNA and polypeptide. This makes the claimed antibodies to PR0539 polypeptide 
useful for diagnosing cancer, particularly lung and colon tumors. The Examiner has provided 
almost no evidence to the contrary. Given the overwhelming amount of evidence in support of 
Appellants' position, and the near absence of any evidence in support of the Examiner's position, 
when considered as a whole the evidence leads a person of ordinary skill in the art to conclude 
that the asserted utility is more likely than not true. 

8. The Courts have held that the Utility Requirement was Satisfied in Similar 
Cases 

The seminal decision interpreting the utility requirement of 35 U.S.C. § 101 is Brenner v. 
Manson, 383 U.S. 519, 148 U.S.P.Q. 689 (1966). At issue in Brenner was a claim to "a 
chemical process which yields an already known product whose utility - other than as a possible 
object of scientific inquiry - ha[d] not yet been evidenced." Id, at 529, 148 U.S.P.Q. at 693. The 
Patent Office rejected the claimed process for lack of utility because the product produced by the 
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claimed process had no known use. See id. at 521-22, 148 U.S.P.Q. at 690. On appeal, the Court 
of Customs and Patent Appeals reversed, holding "where a claimed process produces a known 
product it is not necessary to show utility for the product." Id, at 522, 148 U.S.P.Q. at 691 . 

In reviewing the lower court's decision, the Court made its oft quoted statement that 
u [t]he basic quid pro quo contemplated by the Constitution and the Congress for granting a 
patent monopoly is the benefit derived by the public from an invention with substantial utility. 
Unless and until a process is refined and developed to this point - where specific benefit exists in 
currently available form - there is insufficient justification for permitting an applicant to engross 
what may prove to be a broad field." Id. at 534-35, 148 U.S.P.Q. at 695. 

The first opinion of the C.C.P.A. applying Brenner was In re Kirk, 376 F.2d 936, 153 
U.S.P.Q. 48 (C.C.P.A. 1967). The invention claimed in Kirk was a set of steroid derivatives said 
to have valuable biological properties and to be of value "in the furtherance of steroidal research 
and in the application of steroidal materials to veterinary or medical practice." Id. at 938, 153 
U.S.P.Q. at 50. In affirming the claim rejection based on a lack of utility, the court held that the 
"nebulous expressions 'biological activity' or 'biological properties'" did not adequately convey 
how to use the claimed compounds. Id. at 941, 153 U.S.P.Q. at 52. The court also rejected 
appellants' supporting affidavit, stating, "the sum and substance of the affidavit appears to be 
that one of ordinary skill in the art would know 'how to use' the compounds to find out in the 
first instance whether the compounds are - or are not - in fact useful or possess useful properties, 
and to ascertain what those properties are." Id. at 942, 153 U.S.P.Q. at 53. 

The instant case is clearly distinguishable from Kirk, In Kirk, the asserted utility for the 
claimed compounds was "a new class of compounds often possessing high biological activity" 
and "intermediates in the preparation of compounds with valuable biological properties...". Id. 
at 1120, 1121. Here, Appellants have asserted a much more specific utility than "biological 
activity" or "biological properties." Appellants have provided evidence of amplification and 
overexpression of the PR0539 gene in certain cancers and have shown that this evidence is 
reasonably correlated to overexpression of the PR0539 polypeptide in those same cancers, 
namely, lung and colon cancer. As Example 12 of the Utility Guidelines make clear, when a 
protein is differentially expressed in cancer compared to normal tissue, the protein and antibodies 
have utility in diagnosing the cancer. 
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Since the Brenner and Kirk decisions, the courts have continued to clarify what is 
sufficient to satisfy the utility requirement. Three more recent decisions are of particular 
relevance to the instant application: Nelson v. Bowler, 626 F.2d 853, 206 U.S.P.Q. 881 (C.C.P.A. 
1980), Cross v. Iizuka, 753 F.2d 1040, 224 U.S.P.Q. 739 (Fed. Cir. 1985), and Fujikawa v. 
Wattanasin, 93 F.3d 1559, 39 U.S.P.Q. 2d 1895 (Fed. Cir. 1996). 

The earliest of these cases, Nelson v. Bowler, involved an interference between two 
applications related to derivatives of naturally occurring prostaglandins (PG). Nelson, 626 F.2d 
at 854-55. The issue was whether Nelson had shown at least one utility for the compounds at 
issue to establish an actual reduction to practice. Id. at 855. The Appellants relied on two tests 
to prove practical utility: an in vivo rat blood pressure (BP) test and an in vitro gerbil colon 
smooth muscle stimulation (GC-SMS) test. In the BP test, the blood pressure of anesthetized 
rats was recorded on a polygraph chart to determine whether an injected compound had any 
effect. Responses were categorized as either a depressor (lowering) effect or a pressor 
(elevating) effect. Id. In the GC-SMS test a section of colon was excised from a freshly-killed 
gerbil for suspension in a physiological solution, and a lever arm was connected to the colon in 
such a way that any contraction was recorded as a polygraph trace. Id. The Board held that 
Nelson had not shown adequate proof of practical utility, characterizing the tests as "rough 
screens, uncorrelated with actual utility." Id. at 856. 

On appeal the C.C.P.A. reversed, holding that the Board "erred in not recognizing that 

tests evidencing pharmacological activity may manifest a practical utility even though they may 

not establish a specific therapeutic use." Id. The Court stated that "practical utility" was 

characterized as a use of the claimed discovery in a manner which provides some immediate 

benefit to the public, establishing the following rule: 

Knowledge of the pharmacological activity of any compound is obviously 
beneficial to the public. It is inherently faster and easier to combat illnesses and 
alleviate symptoms when the medical profession is armed with an arsenal of 
chemicals having known pharmacological activities. Since it is crucial to provide 
researchers with an incentive to disclose pharmacological activities in as many 
compounds as possible, we conclude that adequate proof of any such activity 
constitutes a showing of practical utility. Id. (emphasis added). 

The Court rejected Bowler's argument that the BP and GC-SMS tests are inconclusive 
showings of pharmacological activity since confirmation by statistically significant means did 
not occur until after the critical date. The Court stated that "a rigorous correlation is not 
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necessary where the test for pharmacological activity is reasonably indicative of the desired 
response." Id. (emphasis added). The Court concluded that a " reasonable correlation " between 
the observed properties and the suggested use was sufficient to establish practical utility. Id, at 
857. 

The sufficiency of a "reasonable correlation" in establishing utility was affirmed by the 
Court of Appeals for the Federal Circuit in Cross v. Iizuka, 753 F.2d 1040, 224 U.S.P.Q. 739 
(Fed. Cir. 1985). In Cross, the subject of the interference before the Court was imidazole 
derivative compounds which inhibit the synthesis of thromboxane synthetase, an enzyme which 
leads to the formation of thromboxane A2. At the time the applications were filed, 
thromboxane A2 was postulated to be involved in platelet aggregation, which was associated 
with several deleterious conditions. Id. at 1042. 

The question before the Board and reviewed by the Court was whether Iizuka was 
entitled to the benefit of his Japanese priority application. Id. The Japanese application 
disclosed that the imidazole derivatives showed strong inhibitory action for thromboxane 
synthetase from human or bovine platelet microsomes, an in vitro utility. Id. at 1043. Relying in 
part on Nelson, the Board held that tests evidencing pharmacological activity may manifest a 
practical utility even though they may not establish a specific therapeutic use, and concluded that 
the in vitro tests were sufficient to establish a practical utility. Id. 

On appeal, Cross argued that the basic in vitro tests conducted in cellular fractions did not 
establish a practical utility for the claimed compounds, and that more sophisticated in vitro or in 
vivo tests were necessary to establish a practical utility. Id. at 1050. The Court rejected this 
argument, noting that adequate proof of any pharmaceutical activity constitutes a showing of 
practical utility. Id. The Court accepted the argument that initial testing of compounds is widely 
done in vitro: 

[I\n vitro results... are generally predictive of in vivo test results, i.e., there is a 
reasonable correlation therebetween. Were this not so, the testing procedures of 
the pharmaceutical industry would not be as they are. Iizuka has not urged, and 
rightly so, that there is an invariable exact correlation between in vitro test results 
and in vivo test results. Rather, Iizuka's position is that successful in vitro testing 
for a particular pharmacological activity establishes a significant probability that 
in vivo testing for this particular pharmacological activity will be successful. Id. 
(emphasis added). 
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The Court also noted that in previous decisions, its predecessor court had accepted 

evidence of in vivo utility as sufficient to establish practical utility. The Court reasoned that: 

This in vivo testing is but an intermediate link in a screening chain which may 
eventually lead to the use of the drug as a therapeutic agent in humans. We 
perceive no insurmountable difficulty, under appropriate circumstances, in finding 
that the first link in the screening chain, in vitro testing, may establish a practical 
utility for the compound in question. Successful in vitro testing will marshal 
resources and direct the expenditure of effort to further in vivo testing of the most 
potent compounds, thereby providing an immediate benefit to the public, 
analogous to the benefit provided by the showing of an in vivo utility . Id. at 1051, 
citing Nelson, 626 F.2d at 856 (emphasis added). 

Based on this reasoning, the Court affirmed the decision of the Board, stating that "based 
upon the relevant evidence as a whole, there is a reasonable correlation between the disclosed in 
vitro utility and an in vivo activity, and therefore a rigorous correlation is not necessary where 
the disclosure of pharmacological activity is reasonable based upon the probative evidence." Id. 
at 1050 (emphasis added). The Court therefore held that the disclosed in vitro utility was 
"sufficient to comply with the practical utility requirement of § 101." Id. at 1051. 

The holdings of Nelson and Cross were more recently affirmed in Fujikawa v. 

Wattanasin, 93 F.3d 1559, 39 U.S.P.Q.2d 1895 (Fed. Cir. 1996). In Fujikawa, the Court again 

affirmed the notion that initial screens of compounds provide a practical utility even though they 

may not provide a therapeutic use because "'[i]t is inherently faster and easier to combat 

illnesses and alleviate symptoms when the medical profession is armed with an arsenal of 

chemicals having known pharmacological activities.'" Id. at 1564, quoting Nelson, 626 F.2d at 

856. The Court noted that it may be difficult to predict whether novel compounds will exhibit 

pharmacological activity, and consequently testing is often required to establish practical utility. 

Id. However the Court went on to state: 

But the test results need not absolutely prove that the compound is 
pharmacologically active. All that is required is that the tests be "reasonably 
indicative of the desired [pharmacological] response." In other words, there must 
be a sufficient correlation between the tests and an asserted pharmacological 
activity so as to convince those skilled in the art, to a reasonable probability , that 
the novel compound will exhibit the asserted pharmacological behavior." Id. 
(internal citations omitted, underline emphasis added, italics in original). 

On appeal, Fujikawa argued that Wattanasin had failed to establish an adequate 
correlation between the in vitro and in vivo results to permit Wattanasin to rely on positive in 
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vitro results to establish a practical utility. The Court stated that the Board relied on testimony 
from those skilled in the art that the in vitro results convinced the experts that the claimed 
compounds would exhibit the desired pharmacological activity when administered in vivo, 
including testimony that in vivo activity is typically highly correctable to a compound's in vitro 
activity in the field. Id. at 1565. To overcome this evidence and counter the Board's decision, 
Fujikawa pointed to the testimony of its expert that "there is a reasonable element of doubt that 
some elements may be encountered which are active in the in vitro assay, but yet inactive in the 
in vivo assay." Id, 

The Court rejected this argument: "Of course, it is possible that some compounds active 
in vitro may not be active in vivo. But, as our predecessor court in Nelson explained, a 'rigorous 
correlation' need not be shown in order to establish practical utility; 'reasonable correlation' 
suffices ." Id. (emphasis added). The Court also rejected Fujikawa's reliance on two articles. 
The Court noted that while one article taught that "m vitro testing is sometimes not a good 
indicator of how potent a compound will be in vivo, it does imply that compounds which are 
active in vitro will normally exhibit some in vivo activity." Id. at 1566. Similarly, the Court 
noted that the second article expressly stated that "[f]or most substances, although not for all, the 
relative potency determined in in vitro . . . parallels the in vivo activity." Id. 

The Court concluded that the facts in the case were analogous to the ones in Cross where 
the court relied on a known reasonable correlation between in vitro tests and in vivo activity, and 
therefore affirmed the Board's decision that Wattanasin had established a practical utility with 
the in vitro results. Id. at 1565-66. 

The Nelson, Cross, and Fujikawa cases are very similar to the present case. The 
reasoning of the courts in all three cases that "'[i]t is inherently faster and easier to combat 
illnesses and alleviate symptoms when the medical profession is armed with an arsenal of 
chemicals having known pharmacological activities'" applies to the asserted utility for the 
claimed antibodies. Fujikawa, 93 F.3d at 1564, quoting Nelson, 626 F.2d at 856; see also Cross, 
753 F.2d at 1051 ("Successful in vitro testing will marshal resources and direct the expenditure 
of effort to further in vivo testing of the most potent compounds, thereby providing an immediate 
benefit to the public, analogous to the benefit provided by the showing of an in vivo utility."). 
Like pharmaceutical compounds, nucleic acids, polypeptides, and antibodies which are 
associated with cancer will make it inherently faster and easier to combat cancer. The greater the 
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number of biological markers of cancer medical professionals have access to, the more accurate 
and detailed a diagnosis they can make. The determination that a gene is amplified and 
differentially expressed in cancer compared to normal tissue constitutes at least as significant a 
development in the field of cancer diagnostics as in vitro screening for pharmaceutical activity. 
See Cross, 753 F.2d at 1051 ("the first link in the screening chain, in vitro testing, may establish 
a practical utility for the compound in question. Successful in vitro testing will marshal 
resources and direct the expenditure of effort to further in vivo testing of the most potent 
compounds, thereby providing an immediate benefit to the public"). 

In addition, like in vitro tests in the pharmaceutical industry, those of skill in the field of 
biotechnology rely on the reasonable correlation that exists between gene amplification and 
mRNA and protein expression (see discussion supra). Were there no reasonable correlation 
between them, the techniques that measure gene levels such as microarray analysis, differential 
display, and quantitative PCR would not be so widely used by those in the art. See Grimaldi 
Declaration at ^[5. As in Cross, Appellants here do not argue that there is "an invariable exact 
correlation" between gene amplification, mRNA, and protein expression. See Cross, 753 F.2d at 
1050. Instead, Appellants' position detailed above is that a measured change in gene 
amplification and gene expression in cancer cells establishes a "significant probability" that the 
expression of the encoded polypeptide in cancer will also be changed based on "a reasonable 
correlation therebetween." Id.; see also Fujikawa, 93 F.3d at 1565 ("a 'rigorous correlation' 
need not be shown in order to establish practical utility; 'reasonable correlation' suffices"); 
Nelson, 626 F.2d at 857 (holding that "a rigorous correlation is not necessary" and that a 
"reasonable correlation" will suffice). 

Also of importance is the Court's rejection of the notion that any in vitro testing must be 
statistically significant to support a practical utility. Nelson, 626 F.2d at 857. Likewise, 
qualitative characterizations of a test compound as either increasing or decreasing blood pressure 
was acceptable. Id. at 855 (stating that responses were categorized as either a depressor 
(lowering) effect or a pressor (elevating) effect). This is similar to the data in Example 16, 
where the gene amplification was not evaluated for statistical significance, although Dr. 
Goddard's unchallenged declaration makes clear that the data are reliable and significant. 

There are additional similarities. In Fujikawa, the Board and Court rejected the argument 
that there was no utility because there was no exact correlation between the in vitro and in vivo 
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results in spite of testimony and references establishing a lack of a "necessary" correlation. 
Fujikawa, 93 F.3d at 1565-66. Like the two references rejected by the Board and Court in 
Fujikawa, the references cited by the Examiner may suggest that the correlation between gene 
amplification and increases in mRNA levels and protein levels is not exact. But like Fujikawa, 
portions of the references also support Appellants' assertion, and Appellants have submitted the 
declaration of two experts in the field which state that those in the field rely on the correlation 
between changes in mRNA and protein. See Grimaldi Declaration at ^ 5; Polakis Declaration at 
If 6. Thus, as was the case in Fujikawa, although there may be some evidence that the correlation 
relied on is not exact, the declarations and numerous references submitted by Appellants are 
more than enough evidence to establish that there is a "reasonable correlation" between gene 
amplification, changes in mRNA levels and changes in the level of the encoded protein. 

In conclusion, Appellants have asserted that the claimed antibodies are useful for the 
diagnosis of cancer, particularly lung and colon cancer based on the data in Example 16. This 
utility is far beyond the nebulous expressions "biological activity" or "biological properties" 
rejected in In re Kirk, 376 F.2d 936, 153 U.S.P.Q. 48 (C.C.P.A. 1967). Like Nelson, Cross, and 
Fujikawa, Appellants have asserted a utility which relies on a reasonable correlation between the 
data disclosed in the application and the asserted utility. The fact that there may be limited 
evidence that the correlation is not exact does not invalidate Appellants' showing of utility since 
the correlation need not be a rigorous or exact one. Considering the relevant evidence as a whole, 
Appellants have provided sufficient evidence to establish a reasonable correlation between gene 
amplification, changes in the level of mRNA and corresponding changes in the level of the 
encoded polypeptide. Therefore the claimed antibodies have a practical utility as diagnostic 
tools for lung and colon cancer. 

9. Utility - Conclusion 

Appellants' asserted utility for the claimed antibodies as diagnostic tools for cancer 
corresponds in scope to the subject matter sought to be patented and therefore "must be taken as 
sufficient to satisfy the utility requirement of § 101 for the entire claimed subject." In re Langer, 
503 F.2d 1380, 1391, 183 U.S.P.Q. 288, 297 (C.C.P.A. 1974). The Examiner's unsupported 
arguments and references are not sufficient evidence to make a prima facie showing that "one of 
ordinary skill in the art would reasonably doubt the asserted utility." In re Brana, 51 F.3d 1560, 
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1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995). And even if the Examiner has established a prima 
facie case, Appellants have offered sufficient rebuttal evidence in the form of expert declarations 
and references, which, when considered as a whole, establish that it is more likely than not that 
the asserted utility is true. See In re Oetiker, 977 F.2d 1443, 1445, 24 U.S.P.Q.2d 1443, 1444 
(Fed. Cir. 1992) (stating that the evidentiary standard to be used throughout ex parte examination 
in setting forth a rejection is a preponderance of the evidence, or "more likely than not" 
standard); M.P.E.P. § 2107.02, part VII ("evidence will be sufficient if, considered as a whole, it 
leads a person of ordinary skill in the art to conclude that the asserted utility is more likely than 
not true .") (emphasis in original). 

Finally, the courts' decisions in similar cases make clear that the evidence provided by 
Appellants is sufficient to establish the asserted utility. The evidence does not need to be direct 
evidence, nor does it need to provide an exact correlation between the submitted evidence and 
the asserted utility. Instead, evidence which is "reasonably" correlated with the asserted utility is 
sufficient. See Fujikawa, 93 F.3d at 1565 ("a 'rigorous correlation' need not be shown in order 
to establish practical utility; 'reasonable correlation' suffices"); Cross, 753 F.2d at 1050 (same); 
Nelson, 626 F.2d at 857 (same). Considering the evidence as a whole in light of the relevant 
cases, the Board should find that Appellants have established at least one specific, substantial, 
and credible utility, and the Examiner's rejection of the pending claims as lacking utility should 
be reversed. 

C. Enablement Rejection - Detailed Arguments 

The second issue before the Board is whether Appellants have enabled the pending 
claims such that one of skill in the art would be able to make and use the claimed invention. The 
Examiner has rejected Claims 22-26 under 35 U.S.C. §112, first paragraph, arguing that the 
claimed subject matter was not described in the specification in such a way as to enable one 
skilled in the art to make and/or use the invention. Office Action at 7. The Examiner recites the 
factors for determining enablement from In re Wands, 858 F.2d 731, 8 U.S.P.Q. 2d 1400 (Fed. 
Cir. 1988), relying to a large extent on the arguments made above in support of the rejection for 
lack of utility. 

Appellants submit that Claims 22-26 are enabled such that one of skill in the art could 
make and use the claimed antibodies without undue experimentation. It is well within those of 
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skill in the art to make antibodies which are specific to a disclosed sequence, and the Examiner 
has not stated otherwise. As described above, Appellants assert that the claimed antibodies are 
useful as diagnostic tools for cancer, particularly lung and colon cancer based amplification of 
the PR0539 gene and the resulting overexpression the PR0539 polypeptide in lung and colon 
tumors. Based on the disclosure in the application, one of skill in the art would be able to use the 
claimed antibodies as diagnostic tools to detect overexpression of the PR0539 polypeptide in 
suspected lung and colon tumors without undue experimentation. 

1. Enablement ~ Lesal Standard 

An application enables the claims "if one skilled in the art, after reading the[] disclosure[], 
could practice the invention claimed ... without undue experimentation." Chiron Corp. v. 
Genentech, Inc., 363 F.3d 1247, 1253 (Fed. Cir. 2004). u But the question of undue 
experimentation is a matter of degree. The fact that some experimentation is necessary does not 
preclude enablement; what is required is that the amount of experimentation 'must not be unduly 
extensive.'" PPG Indus. , Inc. v. Guardian Indus., Corp., 75 F.3d 1558, 1564 (Fed. Cir. 1996) 
(quoting Atlas Powder Co. v. E.I. DuPont de Nemours & Co., 750 F.2d 1569, 1576 (Fed. Cir, 
1984)). 

While the application must enable one of ordinary skill in the art to practice the full scope 
of the claimed invention, "[t]hat is not to say that the specification itself must necessarily 
describe how to make and use every possible variant of the claimed invention, for the artisan's 
knowledge of the prior art and routine experimentation can often fill gaps, interpolate between 
embodiments, and perhaps even extrapolate beyond the disclosed embodiments, depending upon 
the predictability of the art." AK Steel Corp. v. Sollac, 344 F.3d 1234, 1244 (Fed. Cir. 2003). 

2. Enablement - Burden of Proof 

In order to make an enablement rejection, the PTO has the initial burden to establish a 
reasonable basis to question the enablement provided for the claimed invention. See M.P.E.P. § 
2164.04. A specification teaching how to make and use the claimed subject matter must be taken 
as being in compliance with the enablement requirement unless there is a reason to doubt the 
objective truth of the statements contained therein which are relied on for enabling support. Id. 
It is incumbent for the PTO "to explain why it doubts the truth or accuracy of any statement in a 
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supporting disclosure and to back up assertions of its own with acceptable evidence or reasoning 
which is inconsistent with the contested statement." Id. (quoting In re Marzocchi, 439 F.2d 220, 
224, 169 U.S.P.Q. 367, 370 (C.C.P.A. 1971). This can be done "by making specific findings of 
fact, supported by the evidence, and then drawing conclusions based on these findings of fact." 
Id 

3. Enablement — Standard of Proof 

Once the examiner has weighed all the evidence and established a reasonable basis to 
question the enablement provided for the claimed invention, the burden falls on the applicant to 
present persuasive arguments, supported by suitable proofs where necessary, that one skilled in 
the art would be able to make and use the claimed invention using the application as a guide. See 
M.P.E.P. § 2164.05. "The evidence provided by applicant need not be conclusive but merely 
convincing to one skilled in the art." Id. (bold emphasis added, underline in original). "A 
declaration or affidavit is, itself evidence that must be considered ." Id. (emphasis in original). 

The examiner must then "weigh all the evidence before him or her, including the 
specification and any new evidence supplied by applicant with evidence and/or sound scientific 
reasoning previously presented in the rejection and decide whether the claimed invention is 
enabled." Id. "The examiner should never make the determination based on personal opinion. 
The determination should always be based on the weight of all the evidence." Id. (emphasis in 
original). 

4. Appellants 9 Specification Teaches How to Make and Use the Claimed Subject 
Matter 

The claimed subject matter relates to isolated antibodies which specifically bind to the 
polypeptide of SEQ ID NO: 7, PR0539. The specification discloses how to make the claimed 
antibodies, including Example 27 on page 127 of the specification which specifically describes 
the preparation of antibodies that bind PRO polypeptides. Specification at 90, line 20 through 97, 
line 4, and 127, line 13, through 128, line 1. In addition, the specification discloses that the 
claimed antibodies can be used in diagnostic assays to detect the expression of PROS 3 9 in 
specific types of tissue. Specification at 98, lines 5-29. In light of the amplification of the 
PR0539 gene in a majority of lung and colon tumors tested, one of skill in the art would expect 
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the PR0539 polypeptide to overexpressed in these tumors as well Therefore, given the teaching 
in the specification on how to make and use the claimed antibodies to detect expression of 
PR0539 in specific tissues, one of skill in the art would be enabled to practice the claimed 
invention without undue experimentation. 

Because Appellants' specification teaches how to make and use the claimed subject 
matter, it must be taken as being in compliance with the enablement requirement unless there is a 
reason to doubt the objective truth of the statements contained therein which are relied on for 
enabling support. SeeM.P.E.P. §2164.04. Id. 

5. The Examiner's Arguments and Appellants 9 Responses 

The PTO has the initial burden to establish a reasonable basis to question the enablement 
provided for the claimed invention. See M.P.E.P. § 2164.04. It is incumbent for the PTO "to 
explain why it doubts the truth or accuracy of any statement in a supporting disclosure and to 
back up assertions of its own with acceptable evidence or reasoning which is inconsistent with 
the contested statement." Id. (quoting In re Marzocchi, 439 F.2d 220, 224, 169 U.S.P.Q. 367, 
370 (C.C.P.A. 1971). This can be done "by making specific findings of fact, supported by the 
evidence , and then drawing conclusions based on these findings of fact." Id. 

The Examiner has attempted to rely on the factors to be considered in determining 
whether a disclosure meets the enablement requirement as described in In re Wands, 858 F.2d 
731, 8 U.S.P.Q. 2d 1400 (Fed. Cir. 1988). Appellants state the Examiner's arguments regarding 
each factor and respond to each in turn below. 

a. The nature of the invention 

The Examiner argues that the invention is "in a class of invention which the CAFC has 
characterized as 'the unpredictable arts such as chemistry and biology.' Mycogen Plant Sci., Inc. 
v. Monsanto Co., 243 F.3d 1316,1330 (Fed. Cir. 2001)." Office Action at 7. 

While Appellants acknowledge that some aspects of biology can be unpredictable, the 
creation of specific antibodies to a disclosed antigen is not unpredictable. This is the very issue 
that was addressed in In re Wands, 858 F.2d 731, 8 U.S.P.Q. 2d 1400 (Fed. Cir. 1988). There 
the CAFC held that the disclosure was sufficient to enable one of skill in the art to make 
monoclonal antibodies to a disclosed antigen without undue experimentation. Id. at 740. If the 
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disclosure was sufficient at the time of filing of the Wands application in 1980, it cannot be that 
the art of making antibodies has become less predictable in the ensuing 25 years. Similarly, the 
use of antibodies to detect a target peptide such as PR0539 in a particular tissue is similarly 
well-established in the art. See Specification at 98, lines 5-18 and cited references which are 
incorporated by reference into the specification. 

Thus, contrary to the Examiner's assertions, the nature of the invention weighs in favor of 
enablement as it is a predictable and well-established aspect of biology. 

b. The breadth of the claims 

The Examiner argues that "[t]he claims broadly encompass not only a particular 
PRO-539 antibody but also include any antibody which binds the polypeptide of SEQ ID NO: 
7." Office Action at 7. 

The broadest pending claim is Claim 22, which recites "An isolated antibody that 
specifically binds to the polypeptide of SEQ ID NO: 7." Thus, only those antibodies which 
specifically bind the PR0539 polypeptide are encompassed by the claims. This is not a broad 
genus of antibodies, but is instead limited to a well-defined genus which binds a specific 
disclosed polypeptide. 

Thus, contrary to the Examiner's assertions, the breath of the claims weighs in favor of 
enablement. 

c. The quantity of experimentation 

The Examiner argues that the quantity of experimentation would be extremely large 
"since there is significant variability in the activity of polypeptides and nucleic acids. It would 
require significant study to identify the actual function of the PRO-539 protein and nucleic acid , 
and identifying a use for this protein and resultant antibody would be an inventive, unpredictable 
and difficult undertaking in itself." Office Action at 8 (emphasis added). 

As Appellants discussed above in response to similar arguments by the Examiner, the 
asserted use of the claimed antibodies does not rely on knowing the function of the PR0539 
protein or nucleic acid. Instead, the claimed antibodies can be used as diagnostic tools for cancer 
by detecting the overexpression of the PR0539 polypeptide in lung and colon tumors. This use 
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in no way depends on the function of the PR0539 protein, and therefore experiments to 
determine the function of PR0539 are unnecessary. 

In addition, as discussed above with respect to the nature of the invention, making 
antibodies which specifically bind to PR0539 does not require a great deal of experimentation, 
as the techniques are well-established in the art. See In re Wands, 858 F.2d 731 (reversing the 
Board's decision of non-enablement and holding that as of 1980, undue experimentation was not 
required to make high-affinity monoclonal antibodies to a target peptide). 

Thus, contrary to the Examiner's assertions, the quantity of experimentation is minimal, 
and this factor weighs in favor of enablement. 

d The unpredictability of the art and the state of the prior art 
Echoing the arguments regarding quantity of experimentation, the Examiner argues that 
"[t]he art is extremely unpredictable with regard to protein function in the absence of reliable 
information regarding the protein activity." Office Action at 8. The Examiner again cites Rost et 
al. 9 arguing that proteins with different sequences may have very different functions. He argues 
that because the function of the PR0539 protein is not known, "it is entirely unpredictable what 
function and activity will be found for this protein," and there is no prior art to resolve this 
ambiguity. Id. 

The Examiner then repeats his arguments in support of the utility rejection, stating that 
"the overexpression of the nucleic acid is not relevant to the utility of the protein" and that 
"[t]here is no evidence that the protein itself is overexpressed." Office Action at 8. The 
Examiner repeats verbatim the arguments made above regarding a lack of a "necessary" 
connection between gene amplification and mRNA and protein levels, citing the Meric et al, 
Gokman-Polar et al and Pennica et al references. Office Action at 8-10. The Examiner 
concludes by stating that "it is entirely unpredictable how one would use this antibody in any 
context whatsoever." Office Action at 10. 

Appellants have respond at length to these arguments above, and refer the Board to the 
discussion of the utility rejection supra. To briefly summarize those arguments, Appellants have 
demonstrated that the references relied on by the Examiner do not support his position, and 
instead support the Appellants' arguments. Appellants have provided substantial evidence in the 
form of additional data, expert declarations, and references which show that it is well-established 
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and predictable that the more than two-fold amplification of the PR0539 gene in lung and colon 
tumors will result in overexpression of the PR0539 protein in those tumors. Therefore given the 
disclosure in the specification, one of skill in the art would be able to use the claimed antibodies 
as diagnostic tools for lung and colon cancer. 

Thus, contrary to the Examiner's assertions, the prior art regarding the correlation 
between gene amplification and protein overexpression is sufficiently predictable that this factor 
weighs in favor of enablement. 

e. Working examples 
The Examiner asserts that the specification has no working examples that relate to the 
antibody or protein, and repeats his arguments that the data showing overexpression of the 
nucleic acid in certain cancer cell lines are not relevant. He argues that "there is no statistical 
showing that the overexpression of the nucleic acids is even significant" and "there is no 
showing that the results from nucleic acids have any correlation with the protein or antibody and 
the art cited above demonstrates that there is no presumption of such a correlation." Office 
Action at 10. 

First, Appellants have submitted the declaration of Dr. Goddard which states that the data 
in Example 16 are sufficiently reliable and significant such that the PR0539 gene can be used as 
a cancer diagnostic tool. See Goddard Declaration at f7. Appellants remind the Board that 
"The evidence provided by applicant need not be conclusive but merely convincing to one 
skilled in the art" and that " [a] declaration or affidavit is. itself evidence that must be 
considered ." M.P.E.P. § 2164.05 (emphasis in original). The Examiner has not presented any 
evidence or argument in response to the Goddard Declaration, and therefore Appellants have 
established that the data regarding amplification of the PR0539 gene is significant. As to the 
Examiner's argument that there is no correlation between the gene amplification data and the 
protein and antibody based on the three cited references, this argument has been addressed above. 

Appellants also note that contrary to the Examiner's assertion, Example 27 on page 127 
of the specification describes the preparation of monoclonal antibodies which can specifically 
bind PRO polypeptides. Specification at 127, line 13 through 128, line 1. In addition, the 
specification teaches how to use antibodies to a PRO polypeptide to detect its expression in 
specific cell or tissue types. Specification at 98, lines 5-18. 
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Thus, contrary to the Examiner's assertions, there are sufficient working examples related 
to the claimed antibodies that this factor weighs in favor of enablement. 



The Examiner summarily states that u [t]he specification provides no specific or 
substantial uses for the PRO-539 antibody or protein." Office Action at 10. 
Contrary to this assertion, the specification teaches that : 

[A]nti-PRO antibodies may be used in diagnostic assays for PRO, e.g., detecting 
its expression in specific cells, tissues, or serum. Various diagnostic assay 
techniques known in the art may be used, such as competitive binding assays, 
direct or indirect sandwich assays and immunoprecipitation assays conducted in 
either heterogeneous or homogeneous phases [Zola, Monoclonal Antibodies: A 
Manual of Techniques , CRC Press, Inc. (1987) pp. 147-158]. Specification at 98, 
lines 5-10. 

In addition, Example 16 which shows a more than two-fold amplification of PR0539 in 

lung and colon tumors teaches that: 

Amplification is associated with overexpression of the gene product, indicating 
that the polypeptides are useful targets for therapeutic intervention in certain 
cancers such as colon, lung, breast and other cancers and diagnostic determination 
of the presence of those cancers . Specification at 1 12, lines 0-3 (emphasis added). 

This claim is supported by substantial evidence discussed above with respect to utility. 
Together, with the level of knowledge in the art and the disclosure of how to make antibodies 
which are specific to PR0539, the specification clearly teaches that the claimed antibodies to 
PR0539 are useful as diagnostic tools for lung and colon cancer, and how such use can be 
accomplished. 

Thus, contrary to the Examiner's assertions, the specification provides sufficient 
guidance on how to use the claimed antibodies such that this factor weighs in favor of 
enablement. 

g. Level of skill in the art 
Appellants agree with the Examiner that the level of skill in the art is high, with the 
typical skilled artisan having a doctorate degree and several years of post-graduate work. Thus, 
this factor weighs heavily in favor of enablement. 



Guidance in the specification 
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6. Enablement - Conclusion 

For the reasons discussed above, all of the In re Wand factors weigh in favor of a finding 
of enablement. The Examiner has made no attempt to argue that one of skill in the art would be 
unable to make the claimed antibodies. The remainder of his arguments are essentially those 
made in support of the utility rejection which fail for the reasons articulated above. Therefore, 
the Examiner has failed to meet his initial burden to establish a reasonable basis to question the 
enablement provided for the claimed invention. See M.P.E.P. § 2164.04. The Examiner has 
failed to back up his assertions "with acceptable evidence or reasoning which is inconsistent with 
the contested statement [of enablement]." Id. (quoting In re Marzocchi, 439 F.2d 220, 224, 169 
U.S.P.Q. 367, 370 (C.C.P.A. 1971). 

And even if the Examiner has met his burden, Appellants have presented persuasive 
arguments, supported by the evidence discussed above with respect to utility, that one skilled in 
the art would be able to make and use the claimed invention using the application as a guide. 
Appellants remind the Board that "[t]he evidence provided by applicant need not be conclusive 
but merely convincing to one skilled in the art." M.P.E.P. § 2164.05 (emphasis in original). 

Considering all of the evidence provided by the Appellants to establish their asserted 
utility, along with the disclosure in the specification, the Board should find that Appellants have 
established that one of skill in the art would be able to make and use the claimed invention 
without undue experimentation, and the Examiner's rejection of the pending claims as lacking an 
enabling disclosure should be reversed. 

D. Conclusion 

In view of the arguments presented above, Appellants submit that the specification as 
filed provides a specific, substantial and credible utility for the claimed antibodies, and that one 
of skill in the art would be able to make and use the claimed antibodies without undue 
experimentation. Appellants therefore request that the Board reverse the Examiners rejections 
under 35 U.S.C. §§101 and 112. 
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VIII. APPENDIX A - CLAIMS ON APPEAL 

22. An isolated antibody that specifically binds to the polypeptide of SEQ ID NO:7. 

23. The antibody of Claim 22 which is a monoclonal antibody. 

24. The antibody of Claim 22 which is a humanized antibody. 

25. The antibody of Claim 22 which is an antibody fragment. 

26. The antibody of Claim 22 which is labeled. 
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IX. APPENDIX B - EVIDENCE 

Attached hereto is a copy of the evidence cited in Appellants' Brief. The list of evidence 
below is accompanied by a statement setting forth where in the record that evidence was entered 
into the record by the Examiner. 



Tab Reference 



Submitted 



Entered 



1 Declaration of Audrey 
Goddard, Ph.D. 



Originally submitted with 
Amendment and Response to 
Office Action mailed July 
15, 2004 at page 14 



Entered by Examiner in Final 
Office Action mailed 
September 8, 2004 



Benjamin Lewin, 
Genes V, 5th ed. 1994, 
pages 1196-1201 



Originally submitted with 
Submission Filed With RCE 
mailed March 7, 2005 as 
Exhibit 1 



Entered by Examiner in Final 
Office Action mailed April 25, 
2005 



Alitalo (Med. Biol., 
(1984) 62:304-317) 



Originally submitted with 
Submission Filed With RCE 
mailed March 7, 2005 as 
Exhibit 2 



Entered by Examiner in Final 
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Originally submitted with 
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Orntoft et al. 
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Amendment and Response to 
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Entered by Examiner in 
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December 20, 2004 



Hyman et al. (Cancer 
Research, (2002) 
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December 20, 2004 
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Molecular Biology of 
the Cell (3 rd ed. 1994) 
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Submission Filed With RCE 
mailed March 7, 2005 as 
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Entered by Examiner in Final 
Office Action mailed April 25, 
2005 
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Final Office Action mailed 
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December 20, 2004 



1 5 Benjamin Lewin, 

Genes VI (1997), pages 
847-848 
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IN THE UNITED .STATES PATENT AND TRADEMARK OFFICE 



In: re Application.of Ashkenaziet at 

SerialNor: 09/903,925 V 

Filed: July 11, 2001.. .. 

For SECRETED AND . ■■ * : 

TRANSMEMBRANE ,, 
POLYPEPTIDES AND NUCLEIC ' 
. ACIDS. - ; - 



Group Art Unit 1647 .. 
Examiner. Fozia Harriid 
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DECLARATI ON OF AUDREY D, GODDARD. Ph,D TINDER 37 Cj.R. S 1 .1 31 
Assistant Commissioner of Patents 

Washington, D.C. 20231 . .: . ' . 

sir : ."; ■' ' • ' • • : 

, - L Audrey D- Goddard. Ph.D. do hereby declare and say as follows: 

1. I am a Senior Clinical Scientist at the Epqwrjmental Medicine/BidOncol 
Affairs Department of Genentech, Inc., South Sain Francisco, California 94Q 80. 

. 2. ; . Betweeal993 and 200 1,1 headed the DNASequenemg^ 
Biology Department of Geaentech, Inc. During this time, my responsibilities included the 
identification and characterization of genes wntnTjutingtotheoncogemc^^ 
of the chromosomal localization of novel genes. 

3, My scientific Curriculuin Vitae, including my list of publications, is attached to and 
forms part of this Declaration (Exhibit A) . , * 



I am femaiar with a variety qf technique knowni'the "art for detecting and 
: quantifying the amplification of bncogenesm cancer, including thb quantitative TaqManPCRCie., 
"gene ^amplification") assay described in me a1)bve captioned patent appticatioa 

5. The TaqMan PGR assay is described, for example, in me following scientific : 
pubMcations: ifiguchi et al, Biotechnology lokl 3-417 (1992) (Exhibit B); Livak et al^CR 
•Methods Appl;, 4:357-362 (1995) (Exhibit C),and Heid et. d/,, Genome Res. 6-986-994 {1996) . 
(ExhibitD). Briefly, the assay is based on the principle that siiccessmlPOl yields a fluorescent! 
signal due to Taq DNA polymerase-mediated exonuclease digestion of a ^ fluorescehtly labeled 
otigonucleotide thatis homologous to a sequence between two' PCR. primers. The extent of 
digestion depends directly on the amount of PCR, and can be quantified accurately by measWing the 
tacrementm fluorescence^^ Th^ is an extremely sensitive 
techmque, which allows detection hi the exponential phase of the PCR reaction, and, as a result,' 
leads to accurate determination of gene copy number. ' 

6. . The quantitative ^ 

successfully used to characterise' genes involved in cancer'development and progtessio* 
Amplification of protooncogenes has been studied in a variety of human tumors, and is widely 
considered as having etiological, diagnostic and prognostic signfficance. This use of ihe quantitative 
TaqMan PCR assay is exemplified by the following scientific pubtications: Pehnica et aL. Vroe. 
; |LatL Acad. Sci. ITSA. 95(25)-i4717-14722 ;<1998) (Exhibit E);. Pirti et o£ Namre 
396(6712);699.703 (1998) (Exhibit F) and Bieche etk, M^Xancer 78:661^666 ^ (1998X(Exhibit. 
0>,the first two.of which lam co-author. In particular, Pehnica efd/. have used the quantitative- 
TaqMan PCR assay to study relative gene ampufication of WISP and c-my ; c:m various cell lines, 
colorectal, tumors and normal mucosa. Pitti cf. a/, studied the genomic amplification of Vdecpy 
. receptor for Fas ligahd in lung and colon cancer, using the quantitative.TaqMan PCR assay. Bieche 
et at used the assay to study gene amplification in breast, cancer • 



v :.;0 ; :: : . • • v^-v^ v iiv, :r, -vvv >-•, 



Serial No*: * 
FUed;* 



.Of; 



7. . Personal: 
controV ^.^W^ide^ 

. number in a tumor tissue sample relative to a normal fa, non ^umor> sample ^ V significant and: . 
, usefminmatthede 

sample serves as a basi S :forusm^ -" 

; 

pathology, Accordingly,^ 

. TaqMah PCR assay in a tumor sample ni^'iw^jjj* useful as a marker for tie 
. of cancer, formomtonng^^ 

' theraW. . V.': . ;- 

8. . Ideclarefurfiiermatall^ 

. Uiat all statements made on Mormation and telief are befieved to be true;' I declare that mese. ' 




•.-•——.•« — - -~~.„ r ,.„ and the like so made are . 
puni^abie by fine or imprfeonment, or bouCunder Section 1001 ofZ Titie 1 8 of me United States 
Code, and that such willful ^statements may jeopardize the validity of the application or any 
patent issuing ihereon. .. • • 




. AUDREY D v GODDARD,>h.pJ 

Genentech, fnc. . ilOCongaSt. 

. 1 DNA Way - <y San Francisco; CA, 941 3T . 

.South Sari Frariclsco, CA, 9409Q . • 415,84.19154 - 

•650 225.6429; 41.5.819,2247 (mobile). - 

gpddarda@gene.com . . . agoddard@pacfceji.net 

PROFESStbNAL EXPERIENCE % < V v 

Genentech, Inc. - ; 1993-present 

South San Francisco, CA . 

2001 - present ; Senior Clinical Scientist : 
; Experimental Medicine /, BioOncblogy, Medical Affaire * 

.Responsibilities: . *'.'"'<'"'..'' . ■ ' ■ 

• Companion diagnostic oncology products 

• Acquisition of clinical samples from Genehtech's clinical trials for translational research 

• Transtationaf research using clinical specimen and data for drug development and 
diagnostics J- .; • 

• Member of Development Science Review Committee, Diagnostic Oversight Team, 21 CFR 
Part HSubteam 

s Interests:;-- *\ ;'•"-• ' : 

. Ethical and legal implications of experiments with clinical specimens and data 
Application of phannatngenowi^ 

1998- 2001 Senior Scientist 

Head of the DNA Sequencing. Laboratory, Molecular Biology Department, Research 
Responsibilities: ; -i; 
Management of a laboratory of up to ntnetew~ihcluding postdoctoral fellow, associate 
\scientist, senior research. associate and research assistants/associate levels 

• Management of a $7SQKbudget f ^ .:«'..• 
- • DNA sequencing core facility supporting a 350+ person research facility, 
. " > DAW sequencing for high throughput gene discovery, - EST$ cDNAs, and constructs 

: * Genomic sequence analysis and gene identification 
. DNA sequence and primary protein analysis 

Research: 

• Chromosomal localization of novel genes 

••' Identification -and characterization of genes contributing to the oncogenic process 

• Identification and characterization of genes contributing to inflammatory diseases 

• Design and development of schemes for high throughput genomic t)NA sequence analysis 

• Candidate gene prediction and evaluation 
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1993-1398 Scientist : -; - ...V, •' 

Head of the DNA Sequencing Laboratory, Motecuiar Bfolqgy Dfepartrnent, Research ' ; . 
Responsibilities \ .'.v. ' '" ' v \ 

. • DNA sequencingcore facility supporting a 350+ person research facility 

• Assumed responsibility for a pre-existing team of five technicians and expanded the group 
into ffleenjnttoducing a level of middle management and additiq : 

• Participated in the development of the basic plan for high throughput secreted protein V 
discovery program - sequencing strategies, date analysis and packing,, database design / 

• High throughput EST and cDNA sequencing for new gene identification. ... 

• design and implementation of analysis tools required for high throughput gene identification. 
+ Chromosomal focalizatlon pf genes encoding novel secreted proteins. 

Research:. • \ '.. ••\ . -.' * 1 

• Genomic sequence scanning for new. gene discovery. , 
> Development of signail peptide selection methods. 

• Evaluation of candidate disease genes. 

• Gmvtthhonponemceptorgene.SNPsincM 

Imperial Cancer Research Fund . . . \ 1989-1 992 

London, UK with Dr. Ellen Solomon - - . 

6/89 -12/92: Postdoctoral Fellow 

• Cloning and characterization, of the genes ftised at the acute promyelocyte leukeimiia » 
translocation breakpoints on chromosomes 17. and 15. : ' 

* Prepared a. successfully funded European Union multi-center grant application 

McM aster University 1983 
Hamilton, Ontario, Canada with Dr. G. p. Sweeney 

5/83 - 8/83: NS ERG Summer Styderit ; .'■ 

• In vrtro metabolism of 0-naphthoflavone In C57BI/6 J and DBA mice 

' .- f' . ■ • ' : ' / . - : • . 

EDUCATION ; 

* . * . . • ■ *•"••„• , * * * * . » * - . 

"Phenotypic and genofypic effects of mutations in .VoroS^ntoS°CaWarf9 " mm 

. Supervisor: Dr. R. A. Phillips . Biophysics. 

Honours B.Sc '. . McMaster University, 

"The in vitro metabolism of the cytochrome P-448 Hamilton,. Ontario, Canada; 1983 
• inducer p-naphthoflavone in C57BL/6J mice," Department of Biochemistry 
Supervisor: Dr. G. D. Sweeney 



. ,Q, . .. . ......... O . ■ . 

ACADEMIC AWARDS • f. .\ V \-'\' ? ' ;.; y ' '.' . 

Imperial Cancer Research Fund Postdoctoral. Fellowship " \ -'taugAaA '•' 

..Medical Research Council Studentship-' -1983-1988 

NSERC Undergraduate Summer Research Award 1983" ' 

'Society of. Chemical Industry Merit Award (Hons. Btochem.) • : .V- 1983 

. Dr. Harry Lyjnan Hooker Scholarship : • ' ' ." ioai iqaV 

., J;LW. Gill Scholarship •-.•»• -' :.?5eJ3K 

. Business and Professional Women's Club Scholarship . \ 1980-1981 '•' 

• WyerhauserFoundatiqn Scholarship _ • .1979-1980 ' 



IMVITED PRESENTATIONS 



- Gehentech's gene discovery pipeline: High thrbughput idehtification, cloning and : 

charaaerization of. novel genes, functional Genomics: From Genome WFuncHoh Litchfield ' 
. Park, AZ, USA October 2000 ..' * • ." 

, High throughput identification, cloning and. characterization of novel genes. G2K:Back to 
iroo^ 06 ' M - nc f? ' n Genome Biology and Technology. R Marco Island; FL.HJSA February 

Quality, cpntrbl in DNA Sequencing: The use of Phred and Phrap, Bay: Area Seouendria 
. Users Meeting,. Berkeley. CA USA April: 1999 , ;. ... ■ - 

High throughput secreted protein identification and cloning. Tenth International Genome 
. ^Sequencing and Analysis Conference/Miami, FL, USA. .September 1998 

. The evolution of DNA sequencing: The Generitech perspective. Bay Area Sequencing Users 
Meeting, Berkeley, CAUSA. May 1998 . • .. • ; ., 

Partial Growth.Hormone InsensitMty: The role of GH-receptor mutations. in Idiopathic Short 
Stature. Tenth-Annual National Cooperative Growth Study Investigators Meetina San • 
Francisco, CA USA October^ 1996. .*.:•;:■••■'.' '. 

Growth hormone (Gf# receptor - defects are present in selected chirdfen with norHBH-deficlent 
: short stature: A rholecular basis for partial GH-lnsensjtivity.. 76 tt \Annual Meeting of The - 
Endocrine Society; Anaheim, CA, USA June 1994 ' . : 

A previously uncharacterized gene, myl. is fused to the retinoic acid receptor alpha gene in 
acute, promyelocyte leukemia. XV International Association for Comparative Research on : 
Leukemia and Related Disease, Padua. Itaiy, October 1991 - 
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PATEMTS 



Goddard A, Godowski PJ, GurneyAL. NL2 Tie ligandhomologue polypeptide Patent 
Number: 6,455,496. Date of Patent Sept 24, 2002- 

Goddard A, Godbwski PJ and GurneyAL. NL3 Tie ligandhomologue. nudeic acids Patent 
Number;.6,426,2t8. Date of Patent July 30;. 2002.' : . .: : - 

Godowski P, . Gumey A, Hillan KJ; Botstein D, Goddard A, Roy M, Feirara N. Tumas D. 
SchwalkR. NL4 Tle .ligand homologue nudeic add. .Patent Number 6 4137 770 Date of 
Patent July2 t 2002: . •; ' . . . • • • 

.Ashkenazf A Fong S, Goddard A Gumey AL, Napier MA, Tumas D, Wood Wl. Nudelaadd 
encoding A-33 related antigen poly peptides. Patent Number 6,410,708. Date- of Patent- 
Jun. 25, 2002. ' ■ ' " .' • - . "■ 

Botstein. DA Cohen RL, Goddard AD, Gumey AL, Hillan .Kj; Lawrence DA,. Levine AJ 
Pennica D,j Roy MA and Wood WL WISP polypeptides, and nucleic adds encoding samel 
Patent Number. 6,387,657. Date of .Patent May 14,. 2002. ...... ...... - , •'. 

Goddard AV Godowski PJ and GurneyAL. Tie iigands. Patent Number- 6,372;491 • Date of 
' Patent April 16. -2002: ' . . 

Godowski PJ, Gurney AL, Goddard A and Hillan K. TIE ligand hornologue antibody Patent 
Number 6,350,450. Date of Patent Feb. 26, 2002. . 

Forig: S. Perrara N, Goddard A, Godowski PJ, Gurney AL, Hillan K and Williams PM. Tie 
. receptor tyrosine kinase ligand homologues. Patent Number 6,348,351. Date of Patent" \ 
Feb: 19/2002. "... •> 

Goddard A Godowski PJ and GurneyAL! Ligarid homologues. Patent Number. 6,348:350 
Date of Patent: Feb, 19 r 200Z. 

.Attie . KM, Carisson LMS, Gesundheit N and. Goddard A . Treatment of partial, growth 
hormone insensitivlty syndrome:. Patent Number 6,207,640: . Date of Patent March 27 

200,1... •. •" ' :• •:' 

Fong S, Ferrara N, Goddard K. Godowski PJ, Gumey AL, Hillan K and Williams PM: Nudeic 
adds encoding NL-3. Patent Numben.6,074,873. Date of Patent .June 13,. 2000 

. .Attie K, Carissbn. LMS, Gesunheit N and Goddard A Treatment of partial growth hormone 
ihse'hsitivity syndrome; Patent Number 5,824,642. Date of Patent October 20, 1 998 

Attie K, Carlsson LMS/ Gesunheit N and Goddard. A. Treatment of partial groWth hormone 
insensitivity syndrome. Patent Number. 5,646,1 13. Date of Patent: July 8,; 1997 

Multiple additional provisional applications filed 
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PUBLIJQAT10NS ; : ^ • ' ^'-.y' 

; Seshasayiee pV Dawd p; Gu Q, Enckson Goddard AD- Comparative sequence analysis of, 4 
the HER2 locus in mouse and man;. Manuscript in; pre^ 

; AbUzzahab MJ; Goddard A Grigorescu F, Laiutier C, Smith RJ and Chemausek SO.. Human 
, IGP-1 receptor mutations resulting in pre^ and post-natal growth retardation. Manuscript in 
preparation. ,* ; ■ . \- 

. . Ajggarwal S, Xie; M-H,.Foster J, Frantz G.Stinson J f Corpuz RT, Simmons L^Hillan K, 

Yansgra DG,.Vandlen RL, Goddard AD.and GUrneyAL. FHFR, a novel receptor fpr the . 
.v ^ fibrtiblast.growth factors: Manuscript submitted. .... 

Adams SH/Chui C, Schilbach.SL, Yu XX, Goddard AD, Grimaldi JC, Lee J/ Dowd P f Cblman 
. S. f Lewin DA. (2001) BFU, a unique aeyl-CoA thioesterase induced in thermogenic brown 
adipose fissue: Cloning, organization of the human gene, and assessment of a potential link 
to obesity. Biochemical Journal Z6Q:^ . . > r. m . - ^ 

Lee X Ho. WH. Maruoka M. Corpuz RT. Baldwin DTI Foster JS. Goddard AD. Yansura DG/ - 
Vandlen RL. Wood Wl. Gumey AL; (2001) IL-17E, a novel proinflammatory ligand fpr the IL- 
1.7 receptor hbmolpg IL-17Rh1 . Journal of Biological Chemishy 276(2): 1660-1664. O 

;■ Xie MvH, Aggarwaf S, Ho W-H, Foster J, Zhang Z, Stinsdn J, Wood Wl, Goddard. AD and 
* GurneyAL (2000). Interleukin (IL)-22, a novel human cytokine that signals through'the 
• Jnferferon-receptor related proteins CRF2-4 arid IL-22R. Journal ot Biologicetf Chemistry 275- 

: ;/3fl335-3133k\'. . :''./. /■•. > 

Weiss.GAi Watanabe CK, Zhong A, Goddard A arid Sidhu SS: ; (20(X)j Rapid mapping of 
. protein functional epitopes by combinatorial alanine scanning: Proc. NatL AoacL ScL USA 97 : 
895(^8954. ^ • - " 

.. Guo S f Yarpaguchi Y f Schiibacfi S, Wacte T ? ;.Lee J t G oddard A French D , Handa H, 
Rosenthal A (2000) A regulator of transcriptional elongation controls vertebrate neuronal 
. development.. A/afune 408: 366-369. .* ; . 

Yan M t Wang L-C, Hymowitz SG, Schilbach S» Lee J, Goddard A, de Vds AM, Gao WQ, Dixit. 
; - : VM; (2000) Two-amino add molecular switch in an epithelial moiph^ 
binding to two distinct receptoi^..Sc/ence2a0c: 523r527. , 

Sehl PD/Tai JTN; Hillan KJ f Brown L/C Goddard A, Yang R, Jib H and Lowe DG. (2000). 
Application pf cDNA microarrays in determining molecular phenotype in cardiac growth, 
development, and response to injury. C//x?u/affon 101: 1990-1999.. 

: Guo S, Brifch J, Teraoka H, Goddard A,. Wilson SW. Mullins MC ahcl Rosenthal A (1999). . 
Development of noradrenergic neurons in the zebrafishhiridbrain requires BMP, FGF8, and 
the homeodomain proteih soulless/Phox2A„ Neuron 24; 555-566! . 

Stone D^Murone, M, Luoh v S, Ye W, Armanini'P;.Gurney A, Phillips HS, Brush, j ; , Goddard 
A,.de Sauvage FJ and Rosenthal A..(1999) Characterization of the itiuman suppressor of ■* 
. .fused;, a negative regulator of the zinc-finger transcription factor GIL Ji. Cetl Sci: 112: 4437- 

•\ Xie M-H, Holcomb I, Deuel B, D6wd;P, Huang A, Vagfe A.Joster J; Liang J, Brush J, Gu Q, 
Hillan K, Goddkrd A and Gurney, A:L (1999) FGF^19; a novel fibroblast growth factor with 
unique specificity for FGFR4. Cytokine 11: 729-735., 
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YanM, Lee j.Schilbach i S, Goddard A and DbdtV. (1999) mE10,anovei.caspase: - 
recnjitment'domairv<»ntainihg proapoptotic molecule. J. Biof.Chem. 274(1 5): 10257.-i.0292. 

Gumey AL,.Mapsters SA, Huang RM,.Pitti RM,. Mark DT, Baldwin! DT, Gray AM- Dowd P 
Brush. J, Heldens S ; Schow P. Gdddard AD, Wood Wl, Baker KP, Godowski PJ and : . 
Ashkenazi A (1999) Identification of a new member of the tumor necrosis factor family and Its 
receptor, , a human ortholog of mouse GITR: C^oentB/d/ogxS^'.alS-SlS. 

Rldgway JBB, Ng E^Kern JA ,Lee J, Brush J, Goddard A and Carter P. (1999) Identification ; - 
of a human anti-CD55 single-chain Fv by subtractive panning of a phage library using tumor 
and nootumor cell lines. Cancer Research 59;. 2718-2723: 

; Pitii RM, Marsters SA Uwrence DA Roy M; Kischkel FC, Dowd P, Huang A, Donahue Cj; 
..Sherwood SW ; Baldwin: DT, Godowski PJ- Wood Wl, Gumey AL; Hilian KJ, Cohen RL, ' 
Goddard AD.Botstein D and Ashkenazi A (1998) Genomic amplification of a decoy receptor, 
for. Fas ligandin lupg and colon cancer. Nature 396(671 2): 699-703. : . 

. Pennica D, SwansonTA Welsh JW, Roy MA Lawrence DA Lee J, Brush J. Taneyhilt LA 

• Deuei B.Lew M, Watanabe.C* Cohen RL, Melhem MF, Finley GG, Quirke P, Goddard AD, : 
Hilian KJ, Gumey AL, BotsteinD and Levlne AJ. (1998), WISP genes are members of the V • 
connective' tissue growth factor family that are up-regulated in wnVt -transformed cells and 
aberrantly-expressed in human, colon tumors. Proc. Natt. Acad: ScK VSA: 95(25V: 14717- 

•14722:'. ■ % - J .:_: •. : :V ■■■ . • . • : 

. Yang'RB, Mark MR, Gray A.Huang A,Xie MH, Zhang M, Gdddard A Wood Wl, Gumey AL' 

• and Godowski PJ. (1998) Toll-like receptor-2 mediates lipopolysaccharide-induced cellular' 
"signalling. Nature 3950699): .284-288. 

•Merchant AM,: Zhu Z, Yuan JQ, Goddard A, Adams CW, Presta LG and Carter P! (1998) An • 
"effluent route to huma^ 

Marsters SA, Sheridan JP,PittiRMi Brush J, Goddard A and Ashkenazi A (1998) . 
Identification of a ligand for the death-dOmain-contaihihg receptor Apo3. Current Biology 8(9): 
. 525r528: • 

>Qe J, Murone M 8 Luoh SM, Ryan A, GuQ. Zhang C, Bonifas JM; Lam.CW, Hynes M; . 
. Goddard A, Rosenthal. A Epstein EH Jri and de Sauvage. FJ. (1 998) Activating Smoothened 
mutations in'sporadlc basaNcelJ carcinoma. Wafure. 391(6662): 90-92: 

Masters SA, Sheridan JP, Pitti RM, Huang A, Skubatch M, Baldwin! Di. Yuan J, Gumey A*' ' 
' Goddard AD, Godowski P .and. Ashkenazi A. (1997) A novel receptor, for Apo2LTFRA.IL 
contains a truncated death domain. Current B/o/ogy. 7(12): 1003-1006. 

Hynes Mi Stone DM, Dowd My Pitts-Meek S, Goddard A, Gumey A and Rosenthal A (1997) 
Control of cell pattern in .the neural tube by the zinc finger transcription factor Gli-1 . Neuron 
19: 15-26: v. •. '••• 

Sheridan JP, Marsters SA, PittrRM, Gumey A., Skubatch M. Baldwin D, Ramakrishnan L, 
Gray CL, Baker K, Wood Wl; Goddard AD,. Godowski P, and Ashkenazi A (1997) Control of 
TRAIL-lnduced Apoptosis by a Family of Signaling and Decoy Receptors. Science 277 
. (5327): 818-821. . . . • -....: 



GoddardAb; DowdP, Chemai^ek S, Geffner M, Gertner J, HintzR, Hopwood N f Kaplan S; 
Ptotriick L, RogoFA, Rosenfidd R; Saengef P, Mauras N, Hershkopf R r Angulo. M and Attie, 
. (1997) Partial growth hormone Insensltivity: The role of growth hormone receptor mutations in 
: Idiopathic short suture. X Pecfofc .m: S51-55.- . . : 

. Klein RD, Sherman D. Ho WH; Stone O, Bennett GL, Moffat B ; Vandlen R f . Simmons L, ; Gu Q v 
Hongo JA, Deyaux.B, PoulsenX Anrianini M f Nozaki C, Asai N, Goddard A, Phillips H, 
Henderson CEi TaKahashi W and Rosenthal A. (1997) A GPMinked protein that interacts with 
Ret to form a candidate netirturin receptor. Nature. 387(6634): 717t21 . 

Stone DlA f Hyries Mi ArmanIM tyL Stoahson TA, Gu Q; Johnson* RL, Scott MP, Pennica D; 

Goddard A, Phillips H,.Noll M f Hooper JE f de.Sauvage Fand Rosenthal A. (i 996) The 
. tamour-suppressor gene patched: encodes a candidate receptcirfor Sonic hedgehog Nature 
.384(6605): 129-34.; >/ - 

.'. Marsters SA, Sheridan JP, Donahue CJ, Pitti RM,.6ray CL f Goddard AD; Baiier KD and 
Ashkenazi A..(1996) Apo*3. a new member of the tumor necrosis factor receptor family, . 
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ABSTRACT Wnt family members are critical to many 
developmental processes, and components of the Wnt signal- 
ing pathway have been linked to tumorigenesis in familial and 
sporadic colon carcinomas. Here we report the identification 
of two genes, WISP-1 and WISP-2, that are up-regulated in the 
mouse mammary epithelial cell line C57MG transformed by 
Wnt-1, but not by Wnt-4. Together with a third related gene, 
WISP-3 9 these proteins define a subfamily of the connective 
tissue growth factor family. Two distinct systems demon- 
strated WISP induction to be associated with the expression of 
Wnt-1. These included (i) C57MG cells infected with a Wnt-1 
retroviral vector or expressing Wnt-1 under the control of a 
tetracyline repressible promoter, and («) Wnt-1 transgenic 
mice. The WISP-1 gene was localized to human chromosome 
8q24.1-8q243. WISP-1 genomic DNA was amplified in colon 
cancer cell lines and in human colon tumors and its RNA 
overexpressed (2- to > 30-fold) in 84% of the tumors examined 
compared with patient-matched normal mucosa. WISPS 
mapped to chromosome 6q22-6q23 and also was overex- 
pressed (4- to > 40-fold) in 63% of the colon tumors analyzed. 
In contrast, WISP-2 mapped to human chromosome 20ql2- 
20ql3 and its DNA was amplified, but RNA expression was 
reduced (2- to > 30-fold) in 79% of the tumors. These results 
suggest that the WISP genes may be downstream of Wnt-1 
signaling and that aberrant levels of WISP expression in colon 
cancer may play a role in colon tumorigenesis. 



Wnt-1 is a member of an expanding family of cysteine-rich, 
glycosylated signaling proteins that mediate diverse develop- 
mental processes such as the control of cell proliferation, 
adhesion, cell polarity, and the establishment of cell fates (1, 
2). Wnt-1 originally was identified as an oncogene activated by 
the insertion of mouse mammary tumor virus in virus-induced 
mammary adenocarcinomas (3, 4). Although Wnt-1 is not 
expressed in the normal mammary gland, expression of Wnt-1 
in transgenic mice causes mammary tumors (5). 

In mammalian cells, Wnt family members initiate signaling 
by binding to the seven-transmembrane spanning Frizzled 
receptors and recruiting the cytoplasmic protein Dishevelled 
(Dsh) to the cell membrane (1, 2, 6). Dsh then inhibits the 
kinase activity of the normally constitutively active glycogen 
synthase kinase-3/3 (GSK-30) resulting in an increase in 
jS-catenin levels. Stabilized /3-catenin interacts with the tran- 
scription factor TCF/Lefl, forming a complex that appears in 
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the nucleus and binds TCF/Lefl target DNA elements to 
activate transcription (7, 8). Other experiments suggest that 
the adenomatous polyposis coli (APC) tumor suppressor gene 
also plays an important role in Wnt signaling by regulating 
i8-catenin levels (9). APC is phosphorylated by GSK-3/3, binds 
to 0-catenin, and facilitates its degradation. Mutations in 
either APC or /3-catenin have been associated with colon 
carcinomas and melanomas, suggesting these mutations con- 
tribute to the development of these types of cancer, implicating 
the Wnt pathway in tumorigenesis (1). 

Although much has been learned about the Wnt signaling 
pathway over the past several years, only a few of the tran- 
scriptionally activated downstream components activated by 
Wnt have been characterized. Those that have been described 
cannot account for all of the diverse functions attributed to 
Wnt signaling. Among the candidate Wnt target genes are 
those encoding the nodal-related 3 gene, Xnr3 f a member of 
the transforming growth factor (TGF)-/3 superfamiry, and the 
homeobox genes, engrailed, goosecoid, twin (Xtwri), and siamois 
(2). A recent report also identifies c-myc as a target gene of the 
Wnt signaling pathway (10). 

To identify additional downstream genes in the Wnt signal- 
ing pathway that are relevant to the transformed cell pheno- 
type, we used a PCR-based cDNA subtraction strategy, sup- 
pression subtractive hybridization (SSH) (11), using RNA 
isolated from C57MG mouse mammary epithelial cells and 
C57MG cells stably transformed by a Wnt-1 retrovirus. Over- 
expression of Wnt-1 in this cell line is sufficient to induce a 
partially transformed phenotype, characterized by elongated 
and refractile cells that lose contact inhibition and form a 
multilayered array (12, 13). We reasoned that genes differen- 
tially expressed between these two cell lines might contribute 
to the transformed phenotype. 

In this paper, we describe the cloning and characterization 
of two genes up-regulated in Wnt-1 transformed cells, WISP-1 
and WISP-2, and a third related gene, WISP-3. The WISP genes 
are members of the CCN family of growth factors, which 
includes connective tissue growth factor (CTGF), Cyr61, and 
nov, a family not previously linked to Wnt signaling. 

MATERIALS AND METHODS 

SSH. SSH was performed by using the PCR-Select cDNA 
Subtraction Kit (CLONTECH). Tester double-stranded 

Abbreviations: TGF, transforming growth factor; CTGF, connective 
tissue growth factor; SSH, suppression subtractive hybridization; 
VWC, von Willebrand factor type C module. 
Data deposition: The sequences reported in this paper have been 
deposited in the Genbank database (accession nos. AF100777, 
AF100778, AF100779, AF100780, and AF100781). 
tTo whom reprint requests should be addressed, e-mail: diane@gene. 
com. 
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cDNA was synthesized from 2 ptg of poly(A) + RNA isolated 
from the C57MG/Wnt-1 cell line and driver cDNA from 2 fig 
of poty(A) + RNA from the parent C57MG cells. The sub- 
tracted cDNA library was subcloned into a pGEM-T vector for 
further analysis. 

cDNA Library Screening. Clones encoding full-length 
mouse WISP-1 were isolated by screening a AgtlO mouse 
embryo cDNA library (CLONTECH) with a 70-bp probe from 
the original partial clone 568 sequence corresponding to amino 
acids 128-169. Clones encoding full-length human WISP-1 
were isolated by screening AgtlO lung and fetal kidney cDNA 
libraries with the same probe at low stringency. Clones en- 
coding full-length mouse and human WISP-2 were isolated by 
screening a C57MG/Wnt-1 or human fetal lung cDNA library 
with a probe corresponding to nucleotides 1463-1512. Full- 
length cDNAs encoding WISPS were cloned from human 
bone marrow and fetal kidney libraries. 

Expression of Human WISP RNA. PCR amplification of 
first-strand cDNA was performed with human Multiple Tissue 
cDNA panels (CLONTECH) and 300 of each dNTP at 
94°C for 1 sec, 62°C for 30 sec, 72°C for 1 min, for 22-32 cycles. 
WISP and glyceraldehyde-3-phosphate dehydrogenase primer 
sequences are available on request. 

In Situ Hybridization. 33 P-labeled sense and antisense ribo- 
probes were transcribed from an 897-bp PCR product corre- 
sponding to nucleotides 601-1440 of mouse WISP-1 or a 
294-bp PCR product corresponding to nucleotides 82-375 of 
mouse WISP-2. All tissues were processed as described (40). 

Radiation Hybrid Mapping. Genomic DNA from each 
hybrid in the Stanford G3 and Genebridge4 Radiation Hybrid 
Panels (Research Genetics, Huntsville, AL) and human and 
hamster control DNAs were PCR-amplified, and the results 
were submitted to the Stanford or Massachusetts Institute of 
Technology web servers. 

Cell Lines, Tumors, and Mucosa Specimens. Tissue speci- 
mens were obtained from the Department of Pathology (Uni- 
versity of Pittsburgh) for patients undergoing colon resection 
and from the University of Leeds, United Kingdom. Genomic 
DNA was isolated (Qiagen) from the pooled blood of 10 
normal human donors, surgical specimens, and the following 
ATCC human cell lines: SW480, COLO 320DM, HT-29, 
WiDr, and SW403 (colon adenocarcinomas), SW620 (lymph 
node metastasis, colon adenocarcinoma), HCT 116 (colon 
carcinoma), SK-CO-1 (colon adenocarcinoma, ascites), and 
HM7 (a variant of ATCC colon adenocarcinoma cell line LS 
174T). DNA concentration was determined by using Hoechst 
dye 33258 intercalation fluorimetry. Total RNA was prepared 
by homogenization in 7 M GuSCN followed by centrifugation 
over CsCl cushions or prepared by using RNAzol. 

Gene Amplification and RNA Expression Analysis. Relative 
gene amplification and RNA expression of WISPs and c-myc in 
the cell lines, colorectal tumors, and normal mucosa were 
determined by quantitative PCR. Gene-specific primers and 
fluorogenic probes (sequences available on request) were 
designed and used to amplify and quantitate the genes. The 
relative gene copy number was derived by using the formula 
2 (Act) where ACt represents the difference in amplification 
cycles required to detect the WISP genes in peripheral blood 
lymphocyte DNA compared with colon tumor DNA or colon 
tumor RNA compared with normal mucosal RNA. The 
a-method was used for calculation of the SE of the gene copy 
number or RNA expression level. The tfTSP-specific signal was 
normalized to that of the glyceraldehyde-3-phosphate dehy- 
drogenase housekeeping gene. All TaqMan assay reagents 
were obtained from Perkin-Elmer Applied Biosystems. 

RESULTS 

Isolation of WISP-1 and WISP-2 by SSH. To identify Wnt- 
1-inducible genes, we used the technique of SSH using the 



mouse mammary epithelial cell line C57MG and C57MG cells 
that stably express Wnt-1 (11). Candidate differentially ex- 
pressed cDNAs (1,384 total) were sequenced. Thirty-nine 
percent of the sequences matched known genes or homo- 
logues, 32% matched expressed sequence tags, and 29% had 
no match. To confirm that the transcript was differentially 
expressed, semiquantitative reverse transcription-PCR and 
Northern analysis were performed by using mRNA from the 
C57MG and C57MG/Wnt-1 cells. 

Two of the cDNAs, WISP-1 and WISP-2, were differentially 
expressed, being induced in the C57MG/Wnt-1 cell line, but 
not in the parent C57MG cells or C57MG cells overexpressing 
Wnt-4 (Fig. 1A and£). Wnt-4, unlike Wnt-l, v does not induce 
the morphological transformation of C57MG cells and has no 
effect on /3-catenin levels (13, 14). Expression of WISP-1 was 
up-regulated approximately 3-fold in the C57MG/ Wnt-1 cell 
line and WISP-2 by approximately 5-fold by both Northern 
analysis and reverse transcription-PCR. 

An independent, but similar, system was used to examine 
WISP expression after Wnt-1 induction. C57MG cells express- 
ing the Wnt-1 gene under the control of a tetracycline- 
repressible promoter produce low amounts of Wnt-1 in the 
repressed state but show a strong induction of Wnt-1 mRNA 
and protein within 24 hr after tetracycline removal (8). The 
levels of Wnt-1 and WISP RNA isolated from these cells at 
various times after tetracycline removal were assessed by 
quantitative PCR. Strong induction of Wnt-1 mRNA was seen 
as early as 10 hr after tetracycline removal. Induction of WISP 
mRNA (2- to 6-fold) was seen at 48 and 72 hr (data not shown). 
These data support our previous observations that show that 
WISP induction is correlated with Wnt-1 expression. Because 
the induction is slow, occurring after approximately 48 hr, the 
induction of WISPs may be an indirect response to Wnt-1 
signaling. 

cDNA clones of human WISP-1 were isolated and the 
sequence compared with mouse WISP-1. The cDNA sequences 
of mouse and human WISP-1 were 1,766 and 2,830 bp in length, 
respectively, and encode proteins of 367 aa, with predicted 
relative molecular masses of ^40,000 (M T 40 K). Both have 
hydrophobic N-terminal signal sequences, 38 conserved cys- 
teine residues, and four potential N-linked glycosylation sites 
and are 84% identical (Fig. 24). 

Full-length cDNA clones of mouse and human WISP-2 were 
1,734 and 1,293 bp in length, respectively, and encode proteins 
of 251 and 250 aa, respectively, with predicted relative molec- 
ular masses of ~ 27,000 (M T 27 K) (Fig. 2B). Mouse and human 
WISP-2 are 73% identical. Human WISP-2 has no potential 
N-linked glycosylation sites, and mouse WISP-2 has one at 

C57MG 



Parent 



Wnt-1 



Wnt-4 



WISP 1 - 




Fig. 1. WISP-1 and WISP-2 are induced by Wnt-1, but not Wnt-4, 
expression in C57MG cells. Northern analysis of WISP-1 (A) and 
WISP-2 (B) expression in C57MG, C57MG/Wnt-1, and C57MG/ 
Wnt-4 cells. Poly(A) + RNA (2 /xg) was subjected to Northern blot 
analysis and hybridized with a 70-bp mouse OT.SP-7-specific probe 
(amino acids 278-300) or a 190-bp WISP-2-spccific probe (nucleotides 
1438-1627) in the 3' untranslated region. Blots were rehybridized with 
human /3-actin probe. 
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Fig. 2. Encoded amino acid sequence alignment of mouse and 
human WISP-1 (A) and mouse and human WISP-2 (B). The potential 
signal sequence, insulin-like growth factor-binding protein (IGF-BP), 
VWC, thrombospondin (TSP), and C-terminal (CT) domains are 
underlined. 

position 197. WISP-2 has 28 cysteine residues that are con- 
served among the 38 cysteines found in WISP-1. 

Identification of WISPS. To search for related proteins, we 
screened expressed sequence tag (EST) databases with the 
WISP-1 protein sequence and identified several ESTs as 
potentially related sequences. We identified a homologous 
protein that we have called WISP-3. A full-length human 
WISPS cDNA of 1,371 bp was isolated corresponding to those 
ESTs that encode a 354-aa protein with a predicted molecular 
mass of 39,293. WISP-3 has two potential N-linked glycosyl- 
ation sites and 36 cysteine residues. An alignment of the three 
human WISP proteins shows that WISP-1 and WISP-3 are the 
most similar (42% identity), whereas WISP-2 has 37% identity 
with WISP-1 and 32% identity with WISP-3 (Fig. 3A). 

WISPs Are Homologous to the CTGF Family of Proteins. 
Human WISP-1, WISP-2, and WISPS are novel sequences; 
however, mouse WISP-1 is the same as the recently identified 
Elml gene. Elml is expressed in low, but not high, metastatic 
mouse melanoma cells, and suppresses the in vivo growth and 
metastatic potential of K-1735 mouse melanoma cells (15). 
Human and mouse WISP-2 are homologous to the recently 
described rat gene, rCop-1 (16). Significant homology (36- 
44%) was seen to the CCN family of growth factors. This family 
includes three members, CTGF, Cyr61, and the protoonco- 
gene nov. CTGF is a chemotactic and mitogenic factor for 
fibroblasts that is implicated in wound healing and fibrotic 
disorders and is induced by TGF-/3 (17). Cyr61 is an extracel- 
lular matrix signaling molecule that promotes cell adhesion, 
proliferation, migration, angiogenesis, and tumor growth (18, 
19). nov (nephroblastoma overexpressed) is an immediate 
early gene associated with quiescence and found altered in 
Wilms tumors (20). The proteins of the CCN family share 
functional, but not sequence, similarity to Wnt-1. All are 
secreted, cysteine-rich heparin binding glycoproteins that as- 
sociate with the cell surface and extracellular matrix. 

WISP proteins exhibit the modular architecture of the CCN 
family, characterized by four conserved cysteine-rich domains 
(Fig. 3B) (21). The N- terminal domain, which includes the first 
12 cysteine residues, contains a consensus sequence (GCGC- 
CXXC) conserved in most insulin-like growth factor (IGF)- 
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Fig. 3. (/I) Encoded amino acid sequence alignment of human 
WISPs. The cysteine residues of WISP-1 and WISP-2 that are not 
present in WISP-3 are indicated with a dot. (B) Schematic represen- 
tation of the WISP proteins showing the domain structure and cysteine 
residues (vertical lines). The four cysteine residues in the VWC domain 
that are absent in WISP-3 are indicated with a dot. (C) Expression of 
WISP mRNA in human tissues. PCR was performed on human 
multiple-tissue cDNA panels (CLONTECH) from the indicated adult 
and fetal tissues. 

binding proteins (BP). This sequence is conserved in WISP-2 
and WISP-3, whereas WISP-1 has a glutamine in the third 
position instead of a glycine. CTGF recently has been shown 
to specifically bind IGF (22) and a truncated nov protein 
lacking the IGF-BP domain is oncogenic (23). The von Wil- 
lebrand factor type C module (VWC), also found in certain 
collagens and mucins, covers the next 10 cysteine residues, and 
is thought to participate in protein complex formation and 
oligomerization (24). The VWC domain of WISP-3 differs 
from all CCN family members described previously, in that it 
contains only six of the 10 cysteine residues (Fig. 3 A and B). 
A short variable region follows the VWC domain. The third 
module, the thrombospondin (TSP) domain is involved in 
binding to sulfated glycoconjugates and contains six cysteine 
residues and a conserved WSxCSxxCG motif first identified in 
thrombospondin (25). The C-terminal (CT) module contain- 
ing the remaining 10 cysteines is thought to be involved in 
dimerization and receptor binding (26). The CT domain is 
present in all CCN family members described to date but is 
absent in WISP-2 (Fig. 3 A and B). The existence of a putative 
signal sequence and the absence of a transmembrane domain 
suggest that WISPs are secreted proteins, an observation 
supported by an analysis of their expression and secretion from 
mammalian cell and baculovirus cultures (data not shown). 

Expression of WISP mRNA in Human Tissues. Tissue- 
specific expression of human WISPs was characterized by PCR 
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analysis on adult and fetal multiple tissue cDNA panels. 
WISP-1 expression was seen in the adult heart, kidney, lung, 
pancreas, placenta, ovary, small intestine, and spleen (Fig. 3C). 
Little or no expression was detected in the brain, liver, skeletal 
muscle, colon, peripheral blood leukocytes, prostate, testis, or 
thymus. WISP-2 had a more restricted tissue expression and 
was detected in adult skeletal muscle, colon, ovary, and fetal 
lung. Predominant expression of WISPS was seen in adult 
kidney and testis and fetal kidney. Lower levels of WISPS 
expression were detected in placenta, ovary, prostate, and 
small intestine. 

In Situ Localization of WISP-1 and WISP-2. Expression of 
WISP-1 and WISP-2 was assessed by in situ hybridization in 
mammary tumors from Wnt-1 transgenic mice. Strong expres- 
sion of WISP-1 was observed in stromal fibroblasts lying within 
the fibrovascular tumor stroma (Fig. 4 A-D). However, low- 
level WISP-1 expression also was observed focally within tumor 
cells (data not shown). No expression was observed in normal 
breast. Like WISP-1, WISP-2 expression also was seen in the 
tumor stroma in breast tumors from Wnt-1 transgenic animals 
(Fig. 4 E-H). However, WISP-2 expression in the stroma was 
in spindle-shaped cells adjacent to capillary vessels, whereas 





Fig. 4. (/4, C, £, and G) Representative hematoxylin/ eosin-stained 
images from breast tumors in Wnt-1 transgenic mice. The correspond- 
ing dark-field images showing WISP-1 expression are shown in B and 
D. The tumor is a moderately well-differentiated adenocarcinoma 
showing evidence of adenoid cystic change. At low power (A and B), 
expression of WISP-1 is seen in the delicate branching fibrovascular 
tumor stroma (arrowhead). At higher magnification, expression is seen 
in the stromal(s) fibroblasts (C and D), and tumor cells are negative. 
Focal expression of WISP-1, however, was observed in tumor cells in 
some areas. Images of WISP-2 expression are shown in E-H. At low 
power (E and F), expression of WISP-2 is seen in cells tying within the 
fibrovascular tumor stroma. At higher magnification, these cells 
appeared to be adjacent to capillary vessels whereas tumor cells are 
negative (G and H). 



the predominant cell type expressing WISP-1 was the stromal 
fibroblasts. 

Chromosome Localization of the WISP Genes. The chro- 
mosomal location of the human WISP genes was determined 
by radiation hybrid mapping panels. WISP-1 is approximately 
3.48 cR from the meiotic marker AFM259xc5 [logarithm of 
odds (lod) score 16.31] on chromosome 8q24.1 to 8q24.3, in the 
same region as the human locus of the novH family member 
(27) and roughly 4 Mbs distal to c-myc (28), Preliminary fine 
mapping indicates that WISP-1 is located near D8S1712 STS. 
WISP-2 is linked to the marker SHGC-33922 (lod = 1,000) on 
chromosome 20ql2-20ql3.1. Human WISP-3 mapped to chro- 
mosome 6q22-6q23 and is linked to the marker AFM211ze5 
(lod = 1,000). WISPS is approximately 18 Mbs proximal to 
CTGF and 23 Mbs proximal to the human cellular oncogene 
MYB (27, 29). 

Amplification and Aberrant Expression of WISPs in Human 
Colon Tumors. Amplification of protooncogenes is seen in 
many human tumors and has etiological and prognostic sig- 
nificance. For example, in a variety of tumor types, c-myc 
amplification has been associated with malignant progression 
and poor prognosis (30). Because WISP-1 resides in the same 
general chromosomal location (8q24) as c-myc, we asked 
whether it was a target of gene amplification, and, if so, 
whether this amplification was independent of the c-myc locus. 
Genomic DNA from human colon cancer cell lines was 
assessed by quantitative PCR and Southern blot analysis. (Fig. 
5 A and B). Both methods detected similar degrees of WISP-1 
amplification. Most cell lines showed significant (2- to 4-fold) 
amplification, with the HT-29 and WiDr cell lines demonstrat- 
ing an 8-fold increase. Significantly, the pattern of amplifica- 
tion observed did not correlate with that observed for c-myc, 
indicating that the c-myc gene is not part of the amplicon that 
involves the WISP-1 locus. 

We next examined whether the WISP genes were amplified 
in a panel of 25 primary human colon adenocarcinomas. The 
relative WISP gene copy number in each colon tumor DNA 
was compared with pooled normal DNA from 10 donors by 
quantitative PCR (Fig. 6). The copy number of WISP-1 and 
WISP-2 was significantly greater than one, approximately 
2-fold for WISP-1 in about 60% of the tumors and 2- to 4-fold 
for WISP-2 in 92% of the tumors (P < 0.001 for each). The 
copy number for WISPS was indistinguishable from one (P = 
0.166). In addition, the copy number of WISP-2 was signifi- 
cantly higher than that of WISP-1 (P < 0.001). 

The levels of WISP transcripts in RNA isolated from 19 
adenocarcinomas and their matched normal mucosa were 






Genomic DNA 



Fig. 5. Amplification of WISP-1 genomic DNA in colon cancer cell 
lines. (A) Amplification in cell line DNA was determined by quanti- 
tative PCR. (B) Southern blots containing genomic DNA (10 /xg) 
digested with EcoRl (WISP-1) ovXbal (c-myc) were hybridized with 
a 100-bp human WISP-1 probe (amino acids 186-219) or a human 
c-myc probe (located at bp 1901-2000). The WISP and myc genes are 
detected in normal human genomic DNA after a longer film exposure. 
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Fig. 6. Genomic amplification of WISP genes in human colon 
tumors. Hie relative gene copy number of the WISP genes in 25 
adenocarcinomas was assayed by quantitative PCR, by comparing 
DNA from primary human tumors with pooled DNA from 10 healthy 
donors. The data are means ± SEM from one experiment done in 
triplicate. The experiment was repeated at least three times. 

assessed by quantitative PCR (Fig. 7). The level of WISP-1 
RNA present in tumor tissue varied but was significantly 
increased (2- to > 25-fold) in 84% (16/19) of the human colon 
tumors examined compared with normal adjacent mucosa. 
Four of 19 tumors showed greater than 10-fold overexpression. 
In contrast, in 79% (15/19) of the tumors examined, WISP-2 
RNA expression was significantly lower in the tumor than the 
mucosa. Similar to WISP-1, WISP-3 RNA was overexpressed in 
63% (12/19) of the colon tumors compared with the normal 



100 1 



10-t 



2 
cd 

i 

.2 



& 



1 



6 
a 

CO 



0.1 
10 

1 

0.1 
0>01 
0.001 



WISP-1 



XL 



n ft .. n n n n 



7 



100 -b 



10 



0.1 



Jl 




¥ 



u g 




1 



11 



T 




WISP-3 



82 30 64 6? IB C3 1W 38 120 164 146 184 210 212 208 17 30 2lS 78 
M 81 82 82 82 B2 82 B2 82 B2 B2 B2 CI C2 C2 C2 D DO 0 

Patient #/Dukes Stage 

Fig. 7. WISP RNA expression in primary human colon tumors 
relative to expression in normal mucosa from the same patient. 
Expression of WISP mRNA in 19 adenocarcinomas was assayed by 
quantitative PCR. The Dukes stage of the tumor is listed under the 
sample number. The data are means ± SEM from one experiment 
done in triplicate. The experiment was repeated at least twice. 



mucosa. The amount of overexpression of WISPS ranged from 
4- to >40-fold. 



DISCUSSION 

One approach to understanding the molecular basis of cancer 
is to identify differences in gene expression between cancer 
cells and normal cells. Strategies based on assumptions that 
steady-state mRNA levels will differ between normal and 
malignant cells have been used to clone differentially ex- 
pressed genes (31). We have used a PCR-based selection 
strategy, SSH, to identify genes selectively expressed in 
C57MG mouse mammary epithelial cells transformed by 
Wnt-1. 

Three of the genes isolated, WISP-1, WISP-2, and WISP-3, 
are members of the CCN family of growth factors, which 
includes CTGF, Cyr61, and nov, a family not previously linked 
to Wnt signaling. 

Two independent experimental systems demonstrated that 
WISP induction was associated with the expression of Wnt-1. 
The first was C57MG cells infected with a Wnt-1 retroviral 
vector or C57MG cells expressing Wnt-1 under the control of 
a tetracyline-repressible promoter, and the second was in 
Wnt-1 transgenic mice, where breast tissue expresses Wnt-1, 
whereas normal breast tissue does not. No WISP RNA expres- 
sion was detected in mammary tumors induced by polyoma 
virus middle T antigen (data not shown). These data suggest 
a link between Wnt-1 and WISPs in that in these two situations, 
WISP induction was correlated with Wnt-1 expression. 

It is not clear whether the WISPs are directly or indirectly 
induced by the downstream components of the Wnt-1 signaling 
pathway (i.e., /3-catenin-TCF-l/Lefl). The increased levels of 
WISP RNA were measured in Wnt-1 -transformed cells, hours 
or days after Wnt-1 transformation. Thus, WISP expression 
could result from Wnt-1 signaling directly through jS-catenin 
transcription factor regulation or alternatively through Wnt-1 
signaling turning on a transcription factor, which in turn 
regulates WISPs. 

The WISPs define an additional subfamily of the CCN family 
of growth factors. One striking difference observed in the 
protein sequence of WISP-2 is the absence of a CT domain, 
which is present in CTGF, Cyr61, nov, WISP-1, and WISP-3. 
This domain is thought to be involved in receptor binding and 
dimerization. Growth factors, such as TGF-/3, platelet-derived 
growth factor, and nerve growth factor, which contain a cystine 
knot motif exist as dimers (32). It is tempting to speculate that 
WISP-1 and WISP-3 may exist as dimers, whereas WISP-2 
exists as a monomer. If the CT domain is also important for 
receptor binding, WISP-2 may bind its receptor through a 
different region of the molecule than the other CCN family 
members. No specific receptors have been identified for CTGF 
or nov. A recent report has shown that integrin a v p 3 serves as 
an adhesion receptor for Cyr61 (33). 

The strong expression of WISP-1 and WISP-2 in cells lying 
within the fibrovascular tumor stroma in breast tumors from 
Wnt-1 transgenic animals is consistent with previous obser- 
vations that transcripts for the related CTGF gene are pri- 
marily expressed in the fibrous stroma of mammary tumors 
(34). Epithelial cells are thought to control the proliferation of 
connective tissue stroma in mammary tumors by a cascade of 
growth factor signals similar to that controlling connective 
tissue formation during wound repair. It has been proposed 
that mammary tumor cells or inflammatory cells at the tumor 
interstitial interface secrete TGF-/31, which is the stimulus for 
stromal proliferation (34). TGF-/31 is secreted by a large 
percentage of malignant breast tumors and may be one of the 
growth factors that stimulates the production of CTGF and 
WISPs in the stroma. 

It was of interest that WISP-1 and WISP-2 expression was 
observed in the stromal cells that surrounded the tumor cells 
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(epithelial cells) in the Wnt-1 transgenic mouse sections of 
breast tissue. This finding suggests that paracrine signaling 
could occur in which the stromal cells could supply WISP-1 and 
WISP-2 to regulate tumor cell growth on the WISP extracel- 
lular matrix. Stromal cell-derived factors in the extracellular 
matrix have been postulated to play a role in tumor cell 
migration and proliferation (35). The localization of WISP-1 
and WISP-2 in the stromal cells of breast tumors supports this 
paracrine model. 

An analysis of WISP-1 gene amplification and expression in 
human colon tumors showed a correlation between DNA 
amplification and overexpression, whereas overexpression of 
WISP-3 RNA was seen in the absence of DNA amplification. 
In contrast, WISP-2 DNA was amplified in the colon tumors, 
but its mRNA expression was significantly reduced in the 
majority of tumors compared with the expression in normal 
colonic mucosa from the same patient. The gene for human 
WISP-2 was localized to chromosome 20ql2-20ql3, at a region 
frequently amplified and associated with poor prognosis in 
node negative breast cancer and many colon cancers, suggest- 
ing the existence of one or more oncogenes at this locus 
(36-38). Because the center of the 20ql3 amplicon has not yet 
been identified, it is possible that the apparent amplification 
observed for WISP-2 may be caused by another gene in this 
amplicon. 

A recent manuscript on rCop-1, the rat orthologue of 
WISP-2, describes the loss of expression of this gene after cell 
transformation, suggesting it may be a negative regulator of 
growth in cell lines (16). Although the mechanism by which 
WISP-2 RNA expression is down-regulated during malignant 
transformation is unknown, the reduced expression of WISP-2 
in colon tumors and cell lines suggests that it may function as 
a tumor suppressor. These results show that the WISP genes 
are aberrantly expressed in colon cancer and suggest that their 
altered expression may confer selective growth advantage to 
the tumor. 

Members of the Wnt signaling pathway have been impli- 
cated in the pathogenesis of colon cancer, breast cancer, and 
melanoma, including the tumor suppressor gene adenomatous 
polyposis coli and j3-catenin (39). Mutations in specific regions 
of either gene can cause the stabilization and accumulation of 
cytoplasmic /3-catenin, which presumably contributes to hu- 
man carcinogenesis through the activation of target genes such 
as the WISPs. Although the mechanism by which Wnt-1 
transforms cells and induces tumorigenesis is unknown, the 
identification of WISPs as genes that may be regulated down- 
stream of Wnt-1 in C57MG cells suggests they could be 
important mediators of Wnt-1 transformation. The amplifica- 
tion and altered expression patterns of the WISPs in human 
colon tumors may indicate an important role for these genes 
in tumor development. 
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Baker for technical assistance, P. Dowd for radiation hybrid mapping, 
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methods Peptides AENK or AEQK were dissolved in water, made isotonic with 
HaQ and diluted into RPMI growth medium. T-ceU-proliferadon assays were 
done essentially as described* 11 . Briefly, after antigen puking (30p.gmT ! . 
tTCF} with tetrapeptides (l-2mgnjr'), PBMCs or EBV-B ecus , were 
washed in PBS and fixed for 45 s in 0.05% glutaxaldehyde. Glycine was added 
to a final concentration of 0.1M and the cells were washed five times in RPMT . 
1640 medium containing 1% FCS before co-culture with Trcefl clones in 
round-bottom 96-wefl micro titre plates. After 48 h, the cultures were pulsed 
with 1 u€i of ^-thymidine and harvested for scintillation counting 16h later. 
Predigesrjon ofnativeTTCFwas done by incubating 200 |*g TTCF with 0.25 jig 
pig kidney legumain in 500 uJ 50 mM citrate buffer, pH 5.5, for i h at 37 *C 
GlycopoptWe digestions. The peptides HlDNEEDI, HlDNCN-gIucosarnine) 
EED1 and. H1DNESD1, which are. based on the* TTCF sequence, and.. 
QQQHUK3SblVTDCSGNFCLFR(KKk), which isbased on human transferrin^ 
were obtained by custom synthesis* The three. C-tenninal lysine residues were - 
added to the natural sequence to aid solubility. The transferrin glycopeptide. 
QQQHIJGSNVTDCSGNFGLFR was prepared by cryptic (Promega) digestion, 
of 5 mg reduced, caiboxy-methylated human transferrin followed by 
concanavalin A chromatograpEy >I ..Clycopepddcs corresponding to residues 
622-642 and 421r-452 were isolated by reverse-phase HPLC and identified by 
mass spectrometry and N-termmal sequencing. The ryophilized transferrin- 
derived peptides were redissolved in 50 mM sodium acetate, pH 5.5, 10 mM 
ditruothreitol, 20% methanoL Digestions were performed for 3 fa at 30 °C with 
5-50 mUmT 1 " pig kidney legumain or B-ceU AEP. Products were analysed by 
HPLC or MAIDI-TQF mass' spectrometry using a matrix of lO.mg raT 1 a- 
cyanocmnamic acid in 50% acetomtrue/0.1% TFA and a PerSeptive Biosystems 
Elite STR mass spectrometer set to linear or. reflector mode. Internal standar- 
dization was obtained witji a matrix ion of 568.13 mass units. 
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Fas ligand (PasL) is produced by activated T cells and natural 
killer cells and it induces apoptosis (programmed cell death) in 
target cells, through the death receptor Fas/Apol/CD95 (re£. 1). 
One important role of FasL and Fas is to mediate immune- 
cytotojdc killing of cells that are potendaBy harmnsl to the. 
organism, such, as virus-infected or tumour cells 1 . Here .we 
report the discovery of a soluble decoy receptor, termed decoy 
receptor 3 (DcR3), that binds to FasL and inhibits FasL-induced 
apoptosis. The DcR3 gene was amplilied in about half of 35 
primary lung and colon tumours studied, and DcR3 messenger 
RNA was expressed In malignant tissue. Thus; certain tumours 
may escape FasL-dependent inunune-cytotoxic attack by expres- 
sing a decoy receptor that blocks FasL. . : . • 

By searching expressed sequence tag (EST) databases, we identi- 
fied a. set of related fiSTs that showed homology to the tumour 
necrosis factor (TNFl receptor (TNFR) gene superfeuTuly 2 . Using 
the overlapping sequence, we isolated a previously unknown full- 
length complementary DNA from human fetal lung. We named the 
protein encoded by this ci)NA decoy receptor 3 (l?cR3 j. The cDNA 
encodes a 300-arnino-add polypeptide that resembles members of 
the TNFR femfly (Fig, la): the amino- terminus contains a leader 
■ sequence, which is followed by four tandem cysteine-rich domains 
(Ca^s)'.iiteon^^ (OPG)V 
DcR3 lacks an" apparent transmembrane sequence, which indicates 
that it may be a secreted; rather than a . membrane-asscodated, 
molecule. We expressed, a recombinant, liistidine-tagged form of 
t)cR3 in marnmalian cells? DcR3 was secreted into tfce cell culture 
medium, arid migrated on potyacrylami.de gels as a protein of* 
relative molecular mass- 35,000 (data not shown). DcR3 shares 
. sequence identity in particular with OPG (31%) and TNFR2 
(29%), and has relatively less homology with Fas (17%).. All of 
the cysteines in the four CRDs of DcR3 and OPG are conserved; 
however, the carbdxy-terminal portion of. DcR3' is 101 residues 
. shorter. * 
• We* analysed expression of DcR3 mRNA in human tissues by 
northern blotting (Fig. lb). W> detected a predbniinant l^-kilobase 
transcript in fetal lung, brain, and liver ? and in adult spleen, colon 
. and lung; In addition, we observed relatively, high DcR3 mRNA 
expression in the human colon carcinoma cell line SW480. 

. To investigate potential ligand interactions of DcR3, we generated 
a recombinant, Fc-tagged DcR3 protein. We tested Binding of 
DcR3-Fc to human 293 cells transfected with individual TNF- 
family ligands, which are expressed as type 1 transmembrane 
proteins (these transmembrane proteins have their N tenuini in 
the cytosol). DcR3-Fc showed a significant increase in binding to 
cells transfected with FasL* (Kg. 2a), but not to cells transfected with 
TNF 5 , A^oZL/TRAIL*" 7 , ApoSLiTWEAK 8 -',. or OPGimANCE/ 
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IW^KL 1 ^ 12 {data not shown)'. ^DcR3-Fcinununoprcdpitated'shed 
hsL from FasL-transfected 293 cells (Fig. 2b) and purified soluble 
BasL* (Fig* 2c), as did the 'Fc-tagged ectodoinain of Fas but not 
7NFRl.:GeJ-fiItration chromatography showed that DcR3-Fc and 
soluble FasL formed a . stable complex (Fig. 2d). Equilibrium 
analysis indicated that DcR3-Fc and Fas-Fc bound to soluble 
FasL with a. comparable affinity (JC d = 0.8 i 0.2 and 
M ±0.1nM, respectively? Fig. 2e), and that DcR3-Fc could 
tfodc nearly all of the binding of soluble FasL to Fas-Fc (Fig. 2e, 
inset). Thus, DcR3 competes with Fas for binding to FasL 

To determine whether binding of DcR3 mhibits FasL activity, we 
tested, the effect of DcR3-Fc on apoptosis induction by soluble 
FasL in Jurkat T leukaemia cells, which express Fas,(Fig. 3a). DcR3- 
Fc and Fas-Fc blocked soluble- FasL^induced apoptosis in a 
similar dose-dependent maimer, with half- maximal inhibition at 
—0.1 ugmT 1 . Time-course analysis showed that the inhibition did 
'not merely delay cell death, but rather persisted for at least 24 hours 
(Fig. 3b). We also tested the effect of DcR3^Fc on actfvation- 
induced cell death (AICD) of mature T lymphocytes, a FasL- 
dependent process 1 . Consistent with previous results 13 , activation 
of interleukin-2-stimulate4 CD4-positivc T cells with anti-CD3 
antibody increased, the level xif apoptosis twofold, and Fas-Fc 
blocked this effect substantially (Fig. 3c); DcR3~Fc blocked the 



induction of apoptosis to a similar extent Thus* DcR3 binding 1 
blocks apoptosis induction by FasL. " 

FasL-induced apoptosis is important in.' elimination of virus- 
infected cells and cancer cells by natural kiDer cells and cytotoxic T 
lymphocytes; an alternative mechanism involves perform and 
granzymes M4 .- l \ Peripheral blood natural killer cells triggered 
marked cell death in Jurkat T leukaemia cells (Fig. 3d)rDcR3-Fc, 
and Fas-Fc each reduced killing of target cells from —65% .to 
—30%, with half-maximal inhibition at —1 u-gmT l ; the residual 
killing was probably mediated V^ perform/gran2yrne pathway. 
Thus, DcR3 binding blocks FasL-dependent natural killer cell 
activity. Higher DcR3-Fcand Fas-Fc concentrations were required 
to block natural killer cell activity compared with those, required to 
.block soluble FasL. activity, which is consistent with. the greater 
. potency of membrane^assbciated FasL compared with- soluble 
"FasL 17 . .. . " • -J - ■ . 

Given the role of irnmune-cytotoxic cells in. eliminatibn of 
tumour cells and the feet jhat DcR3 can act as an inhibitor of 
FasL, we proposed that DcR3 expression might contribute to the 
ability of some tumours to escape immime-cytotoxic attack. As 
genomic amplification frequently contributes to tumorigenesisj we 
investigated whether . the DcR3 gene is amplified in cancer*. We 
analysed DdO gene-copy number by quantitative polymerase chain 
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Figure 1 Primary structure and expression of human DcR3. a; Alignment of the 
Bmlno-acW sequences of CteR3 and of csteoproteaerln (OPG); the Oterminal 101 
residues of OPG ere nqt shown. The putative signal cleavage site (arrow), the 
cystelne-fteh domains(CRD 1-4},and the A/-6nkedgrycosyiatlon site (asterisk) ere 
shown, b, Expression of DcR3 mRNA. Northern hybridization analysis was done . 
using the DcR3.cDNA as -a probe and blots .of poMA)* RNA (Ctontech) from 
human ' fetal, find adult tissues or cancer cefl fines. PBU peripheral blood 
lymphocyte. 



Figure 2 Interaction of DcR3 with FasL * 293 caOs were trensfected with pRKfr 
vector (top) or with'pRKS encoding fuWength- FasL (bottom), incubated with 
0cR3-Fc (solid line, shaded erea);TNFfli-Fc (dotted line) or buffer control 
(dashed line) (the dashed and dptted tines overlap), and.analysed for binding by : 
FACSi Statistical analysis showed e slgniflcantdlfference (P < 0D01) between the' 
binding of DcR3rFc to cells trensfected with FasL or pRK& PE ctycoeiythrin- 
labelled cells, b. 293 cells were transfectedealn a and metabolfcaily iebelied,and 
cell supematents were Immune-precipitated with Fcrfagged TNFR1; DcR3 or Fas. 
C Purified soluble FasL (sFasL) wasimmunoprecipitated with TNFR1-Fc DcR3- 
Fc or Fas-Fc and visualized by Immunobkrt wfth eriti-FasL antibody. sFasL waa. 
. loaded directly for comparison in the right-hand lane, d, Rag-tagged sFasL was 
incubated with 0cR3-Fc or with buffer and' resolved by gel filtration; column 
fractions ware analysed in an assay that detects complexes conta(ning DcR3-Fc 
and sFasL-Flag/e. Equilibrium binding of DcR3-Fc of Faa-Fc.to sFast-Hag:' 
Inset, competition of OcR3-Fc with Fas-Fc for binding to sFasL-Fteo/. 
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reaction (PCR) 18 in genomic SNA from 35 primary lung and colon 
tumours, relative to pooled, genomic DNA from peripheral blood 
leukocytes (PBLs) of 10 healthy donors. Eight of i* lung tumours 
and. 9 of 17 colon tumours showed DcR3 gene amplification, 
ranging from 2- to 18-fold- (Fig. 4a» b)l To confirm this result, we 
analysed the colon tumour DN As with three more, independent sets 
• of DcR3-basei PCR primers, and probes; we observed nearly the 
same amplification (data not shown). 

We then analysed DcR3 mRNA expression in primary tumour 
tissue sections by in situ hybridization; We' detected DcR3 expres- 
sion in 6 out of 15 luagtumour's, 2 out of 2 colon tumours, 2 out of 5 
breast tumours, and 1 out of 1 gastric tumour, (data not shown). A 
section through a squamdus-cett carcinoma of the lung is shown in 
Fig. 4c DcR3. raRNA was focalized to infiltrating malignant epithe- 
lium, but was essentially absent from adjacent stroma,, indicating 
tumour-spedfic expression* Although the individual tumour sped- 
mens that we analysed for mRNA expression and gene amplification 
were different, the in situ hybridization results are consistent with 
the finding that the DeR3 gene is amplified frequently in tumours. 
SW4S0 colon carcinoma cells,; which showed abundant DcR3 
mRNA expression (Fig. lb), also had marked DcR3 gene amplifica- 
tion, as shown by quantitative PGR (fourfold) and by Southern blot, 
hybridization (fivefold) (data not snown). • 

If E>cR3 amplification in cancer is functionally relevant, then 
DcR3 should be amplified more- than neighbouring genomic 
regions that are not important for tumour survival. To test this, 



we mapped the human DcR3' gene by mdiation^hybrid analysis; 
DcR5 showed linkage to marker AfM218xe7 (T160), which maps to 
chromosome.position 20*13;, Next, we isolated from a bacterial 
artificial chromosome (BAC) library a human genomic done that 
carries DcR3, and sequenced the ends of the doners insert We then 
ckterrained, from the nine colon tumours that showed twofold or 
greater amplification of DcR3, the Copy number of the DcR3- 
flanking sequences (reverse and forward) from the BAC, and of 
seven genomic markers that span chromosome 20 (Kg; 4d) The 
;DcR3^1inked reverse marker showed an average amplification of 
rougher threefold,, slightly fess than the approximately fourfold 
amphfication of DcR3; the other markers, showed little or no 
amplification. These data indicate that DcR3 may be at the Epi- 
centre* of a distal chromosome 20 region that is amplified in colon 
cancer, consistent with, the possibility thatDcR3 ^amplification 
promotes tumour survival. 

OurriultssWmatDcR^bmdsspecificallym 
FasL activity. We did not detect DcR3 binding to several other TNF- 
Kgand-fenuly members; however, this does not rule out the possi- 
bdity that DcR3 interacts with other ligands, as .do some other 
TNFR family members, including OPG u *. 

FasL is important in regulating the immune response; however 
hole is known about how JasL fanction is controfled. One mechan- 
ism involves the molecule cFIlP, which modulates apoptosis signal- 
Img downsueam of Eas?. A second mechanism involves proteolytic 
shedding of EasL jromthe r^surfAcc". DcR3 competes with Fas for 
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figured Inhibition of FasL actMty by DcRa 8,,Human JurkatT leukaemia cells 
•were incubated with Rag-tagged soluble. FasL (sFast; Sngmr 1 ) oligomerized' 
with entHRag antibody (6.1 jig mr') in the presence of the proposed inhibitors 
DcR3-Fc. Fas-Fc or human IgGl antf assayed for apoptosia (mean ± s.e,m. of 
triplicates). b.Jurkat cells wereMcubated with *sFa$L-Flag plus anti-Flag antibody 
as in a, in presence of 1 ngmr' OcR3-Fc (fiOed circles); Fas-Fc (open circles) or 
human IgGl (triangles), andJapoptosts was determined at the indicated time 
points, c Peripheral btoodTcefls were stimulated wftrj PHA and imerieukln-2. . 
followed by control (white bars) or antK503 antibody .(fiJied bars), together with 
phosphate-buffered saline (PBS),- human IgGl.' Fas-Fc, or OcR3-Fc ( 10 ^ mr 1 ):. 
AfteM6 h, apoptosis of d&T cells was determined (mean ± s.e jti. of results from 
five donors), d; Peripheral blood natural killer cefls were Incubated with. 5, Cr- 
labeUed Jurkat cells in the presence of DcR3-Fc (fined circles). Fas-Fc (open 
circles)- or human lgGl (triartgJesjL arid target-cell death was determined by 
release of B, Cr (mean ± s.d for two donors, each In triplicate). 



Figure 4GenomiC amplification of DcR3 in tumours, a. Lung cancers, compnslng 
eight adencwalrc^ 

m. n. o. p. ql one non-smam;ell carcinoma (b), one smaOrceil carcinoma (I), and 
one bronchial adenocarcinoma fl};.The data are means ± a.d. of 2 experiments 
done in duplicate. b> Colon tumour8..compris^g 17 adenocarcinomas. Data are 
means.* s.e.m. of five- experiments done in duplicate; */fl situ hybridization 
analysis of DcR3 mRNA expression in a squamous-cefl carcinoma of the jung. A 
representative- bright-field Image (left)and the (xrrespondlng.dark-fieidlirnage 

• (right) show DcR3 mRNA oyer mnltrating maligriant epftheflum (arrowheads) 
Adjacent non-mangnam stoma (S), blood ve$sel-(V) and necrotte.tumotjr tissue 
(N) are also shown, d; Average amplification of DcR3 compared with amplifica- 
tion of neighbouring genomic regforis (reverse and'forward, Rev and>wd), the 
DcR3-llnked marker TifiO, and other chro'mosome-20 markers, in the nine colon 
tumours showing DcR3 ampliricatfon of twofold or more (b). Data are hom two' 
experiments -done In dupflcata Asterlsk hdicatesJP<0:D.i for a Students Mest 

. comparing each marker with DcR3. ' 
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fasL binding; hence; it may represent a third mechanism, of 
extracellular regulation of FasL activity: A decoy receptor, that 
- modulates the . function of the- cytokine interieukin-1 has - been 
described? 1 * In addition, two decoy receptors that belong to the 
TNFR family, Dcftl and DcR2> regulate the FasLrelated apoptbsis- 
inducing molecule ApoaL 22 . Unlike DcRr and DcJU, which are 

• membrane-associated proteins, DcR3 is directly secreted into the 
extracellular space. One other secreted TOFR-famiry member is 
OPG 3 , which shares, greater sequence homology with DcR3 (31%) 
than do DcRl (17%) or DcR2 (19%); OPG functions as a. third 

: dccoy.for Apo2L". Thusj DcR3 and OPG define a. new subset of 

* TNFR-£un3y members that function as secreted decoys to mod- 
. ulate ligands that induce, apoptosis. Pox viruses produce soluble. 

TNFR homologues that neutralize 'specific TNFfamily- ligands, 
thereby modulating the antiviral immune response 7 . Our results, 
indicate that a similar mechanism, namely, production of a soluble 
decoy receptor for FasL, may contribute to immune evasion- by 



certain tumours.. 



Methods 

; teolatfoh of DcR3 cDNA. Several overlapping EST* in Gen Bank (accession 
numbers AA025672, AA025673 an* W67560) and - in Ufeseq? 1 (Incyte 
Phannaceudcals; accession numbers 1339238, 1533571, 1533650, 1542861, 
1789372 and 2207027) showed similarity to members, of me TNFR family. We 
screened human cDNA libraries by PGR with primers based qu the region of 
EST consensus; fetal lung was positive for a 'product of the. expected size. By 
hybridkadon ta a PCR-gcnerated probe based on the ESTs, one positive done 
(DNA30942)f was. identified. When searcrung for potenti^ 
-forms of DcR3 that might encode a transmembrane protein,. we isolated 50" 
more clones; the coding regions of these clones wexe identical in size to that of 

* the'imtial done J (data not shown). \f\\. 
Refusion proteins (immunoadh astro);' The enure DcR3 sequence, or the 
ectodomain of Fas or TNFRl , was, fused to the hingc:and Ft region oHiiiman 
IgGl, expressed in ins«*$F9 cells or m.human^293'. cells, and -purified, as 

'described". . £ 

Ruorescenee-activat^d celi sorting (PACS) analysis. We transfected 293 
cells using calcium phosphate orEffectene f Qiagen) wth pRKS-vector or pRKS 
GHX)din&ruI14engt}i human Fast' (2 p.g), together with pRK5 encoding CnnA 
(2 fig)' to- prevent ceD death. After I6h, the cefls were incubated 4 with 
biotinyiated DcR^Fc or piTRl-Fc and then with phycoerythrin-conjugated 

* streptavidin (GibcoBRD'r and were assayed by EACS. The data were analysed by 
Kobrwgoro^Siiuraovstatkticai analysis. There was some detectable staining 
of vector^transfepted cells by DcR3-Fa as these cells express little Fast (data * 
not shownfcit is possible that DcR3 mxtgnized some other factor tKat.is* 
expressed constimtrvdy on 293 cells, " 

(mmunoprectpftatiori. Human 293 cells were transfectcd as above,, and - 
metarxilicauy labelled with [ 35 S] cysteine and [^methionine. (05 mCU. 
Ameisham). After Wh.of culture in the presence of z-VAD-fink { 10 ul4}, 
the medium was immunoprecipitated with DcR3-Fc, Fas-Fcor TNFRl -Fc 
■ (5>g), followed by protein A-Sepharose. (Rcphgen). The precipitates were 
"resolved by SDS-PAGE and visualized on a phosphorimager (Fuji BAS2000). . 

* Alternatively, purified. Flag-tagged soluble FasL (1 jig) (Alexis)! was incubated 
with each Fc-fiision protein (1 jig); precipitated with protein A-Sepharose, 
resolved by SDS-PAGE and- visualized by immunoblortiog with' rabbit anbV 
FasL antibody (Oncogene Research).. 

Analysis of complex formation. Flag-tagged soluble FasL (25 pig) was 
' mcubatedwith buffer or with DcR3-Fc (40 u,g) for LSh at 24 °C. The reaction 
was loaded onto a Superdex 200 HR 10/30 column (Pharmada) and devdoped 
with EB$> 0.6-mi fractions were collected. The presence of DcR3-Fc-FasL 
complex in each traction was analysed oy placing 100 jil aiiquots into miootitre 
wells precoated with anti-human lgG (Boehringer); to capture DcR^-Pc, 
faDowedbydetection.Mmbibriir^lated (Kodak) and 

streptavidin^horseradish peroxidase (Amersham). Calibration of the column 
indicated an apparent reWeimolecular massofthe-complex of 420K (data not 
shown), which is consistent with a stoichiometxy of two DcR3-Fc homo aimers 
to two soluble FasL homotrirners. 

Equilibrium binding analysis. Microti tre wells were coated .with anti-human 



n 



IgG. blocked with 2% BSAin PBS; DcR3-Fc or Fas-Fc was added, foBowecl by 
serially diluted Flag-tagged soluble FasL. Bound ligand was detected with anti- 
Hag antibody as above. In. the competition assay, Fas-Fc was mimobilized as. 
above, and the wefls were blocked with excess IgGl before addition of Flag- 
tagged soluble FasL plus i>cR3-.Fc. . - . 
T-ceU AICD. CD3* lymphocytes were isolated from peripheral blood of 
^dividual donors using anti-CD3 magnetic beads (Miltenyi Biotech), 
stimulated with phytohaemagglutinin (PHA; 2 ugmT^ for 24 h, and cultured 
in the presence of interleukin-2 (lOOUmT 1 )^ 5 daysilne cefls were plated in • 
wells coated with anti-CD3 anuWy (Pruinrungen) and analysed for apoptosfe 
16 b later by FAGS analysis of armexm-V^buiding of CD4* ceUs". 
Natural killer cell activity. Natural killer cells were Isolated from pcripheraj 
blood of individual donors using anti-QD56 magnetic beads (Mfltenyj 
.Biotech), and incubated for 16 h with s, 0-loaded Jurkat ceils at an eflfector- 
to-target ratio of 1:1. in the presence of DcR3-Fc, Fas^Fc or. human IgGl: 
Target-cell death was determined by release oJE ^Cr in effector-target co- 
culcures relative to:release of 5, Cr by detergent lysis of equal numbers of Jurkat 
cells. . 

Gone-amplrricatl.on analysis. Surgical specimens were provided by J; Kern 
(lung tumours) and P. Qjiirke (colon tumours). Genomic DNA was extracted 
(Qiagen) and the. concentration - was determined using Hoechst dye- 33258 
intercalation fluorometry. Amplification was deiennined by quantitativePCR'* 
using a TaqMan mstrument (ABI). The method was validated by comparison of 
PCR and Southern hybridization data for the Myc and HER-2 oncogenes (data 
not shown). Gene^spedfic primers and. fluorogenic probes were designee! on 
the basis of the sequence of DcR3 or of nearby regions ujentirled on a BAC 
<arrying the human DcR3 gene; alternatively, primers and probes were based 
- oh Stanford Human Genome Center marker AFM218xe7 (T160), which is 
linked to DcRJ (likelihood score = 5.4). SHGC-3626a (Ti59);'the nearest 
avaflable marker which maps to -500 kilobases from T160, and five extra 
markers that span chromosome 20. The DcR3-specific primer sequences were 
5 f -CrTCTTCGCGCACGCTC.-3 ' and 5' -ATCAGGCCGGCACCAG-3' and the 
fluorogemc probe sequence was 5'-(FAM-AGACGATGCGTGCTCCAAGCAG 
AAp- (TAMARA)» where f AM. is S'-fiuores^ phospKoraraidite. Relative 
gene- copy numbers were derived -using the formula 2 CACT * where ACT is the 
difference in amplification cycles required' to. detect DcR3 in peripheralblood 
lymphocyte DNA compared to test DNA. 
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Crystal structure of the 
ATP-binding subunit 
of an ABC transporter 
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ABC transporters (also known as traffic ATPases) form a large 
family of proteins responsible for the translocation of a variety 
of compounds across, membranes of both prokaxyotes and 
eukaryotes 1 . Hie recently completed Escherichia coU genome 
sequence revealed that the largest family of paralogoii^nVWi 
proteins is composed of ABC transporters 2 . Many VuTwryotic 
proteins of medical significance belong to tb4s=famUy/ such as 
the cystic fibrosis transmembrane a>nductanceregulator (CFTR), 
the >gycoproteih for muldo^-resisWce.'piotein) and the 
heterc&meric transporter, associated^witii. antigen processing 
(Tapi^Xap2); Here we report the crys^structure at 1.5 A resolu- 
tion of HisP, the ATP-binding subunit of the rustidihe permease, 
which, is an ABC transporter fron* Salmonella typhimuriutru We 
correlate the details of thfes^cturewim me biochemical; genetic 
and biophysical properties of the wild-type and several mutant. 
HisP proteins. The, structure provides a basis for understanding 
properties of ABC transporters and of defective CFTR proteins. 

•ABC transporters contain four structural domains: -two nudeo- 
We^mam^omains (NBDs), which are highly, conserved 
throughout the family, and two transmembrane" domains'. In 
•^rokarybtes these domains are often separate subunits. which are 
assembled into a membranes-bound complex; in eukaryotes the 
domains are generally fused into a single polypeptide chain, The 
periplasm* histidine permease of & typhimuriutn and £ cotf' 3 * is ai 
weD-chara<*erizcd ABC transporter mat is a good model for. this 
superfarnily. It consists of a membrane-bound complex, HisQMP 2 » 
which comprises integral membrane subunitSr HisQ and HisM, and 
two copies of HisP,. the ATP-binding subunit HisP, which has 
properties intermediate between those of integral and peripheral 
membrane proteins', is accessible from'both sides.of the membrane, 
presumably by its interaction with HisQ and HisM 6 ; The two HisP* 
subunits form a dimer,. a* shown by their, cooperatrvity in ATP 
hydrolysis 5 ; me requirement for both subunits to be present for 
: activity*, and the formation of a HisP dimer upon chemical cross- 
linking. Soluble HisP also forms a dimer 3 : HisP has been purified 
and characterized in an active soluble form 5 which can be recon- 
stituted into a fuBy active membrane-bound complex 1 . 
- ; The overall shape of the crystal structure of the HisP.monomer is 
that of an V with two thick arms {arm I and arm. II); the ATP- 
binding pocket is near the end of arm I (Fig. I). A six-stranded 15- 
sheet (03 and 08-012) spans both arms of the L, with a domain of a 
ct- plus 0-type structure (01, 02, 04-07, a! and a2) on one side 
(within arm I). and a domain of mostly a-helices (ct3-ra9) on the 



ARM II 




ARM I 




.Figure r Crystal structure of HlsR a. View of the dlmac. along an- axis 
rjerpendicular to its tworfold axis..The top and bottom of the dimer are suggested: 
to face towards the perlplasmic and cytoplasmic sides, respectively {see text). 
The thickness of arm II is about 25 A! comparable to that of membrane. a-HeJices 
ere shown In orange and a-sheets In green, b; View along the two-fold axis of the 

. HisP dimer. showing the relative displacement of the monomers not apparent In 
a.The p-strands at the dimer interface are labelled. * View of one monomer from 
thtrbottom of ecm I. as shown h a, towards arm II. showing.tho ATP-binding * 

. pocket a-*, The protein and" the bound. ATP are in 'ribbon* and natortfsticV 
representations, respectively. Key residues discussed in the text are Indicated In 
c These, figures were prepared with MOLSCRIPHF. N, amino terminus C, C 
terminus. ■' 
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Inl X Cancer: 78, 661-666.(1998) - . ' "'■'"'figfe* ^""^^Intema^ 
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NOVEL APPROACH TO QUANTITATIVE POLYMERASE CHAIN REACTION USING " 
REAL-TIME DETECTION: APPLICATION TO THE DETECTION OF GENE 
AMPLEFICAHON IN BREAST CANCER 

Ivan Bieche 1 - 2 , Martihe Olm 1 , Marie-Helehe ChampemE 2 , Dominique Vidaui>\ Rosette Ltdereai^ and Michel Yidaud 1 * • ■ 
l Laboratoire de Ginetiqve Moliculaire, Facuiie des Sciences Pharmaceutiques et Biologiques de Paris, Paris, France 
2 Laboratoired'Oncogenetique; Centre Rene Huptenin, ShCloud, France ■ . 

- Gene amplification is a common event in the progression of surrounding the.link suggested by Slamon et al i\9%7\ between 

merit (ABI Prism- 7700 Sequence Detection System) capable mvo!ved - . . . 

of measuring fluorescence In real-time, was used to quantify Specific genes (DNA) were initially quantified in tumor cells by 

gene amplification In tumor DNA. Reactions are character- means of blotting procedures such as Southern and slot blottine. 

ixedby the point during cycling when PCR amplification is still These batch techniques require large amounts of DNA fS^lO 

.in the exponential phase, rather than the amount of PCR. uetectionVto viell kIImT^^Z^ZZ^ iK*L£^ 

product accumulated after a fixed number of cycles. None of EriK^T!^ ^ Fur * ennore ' 

the reaction components Is limited during the exponential ™ *™ r f* CS r°« lh 5 pTOCcdurcS **' 

phase, meaning that values are highly reproducible m reac- f TOCR 7 T e b i Q Ji °J ^cient quality for reliable dosage analysis, 

tions starting with the same copy number. This greatly Kecent v» P V R . 35 pr 0 ? 63 * to be a powerful' tool for quantitative 

improves, the precision of DNA quantification. Moreover, DNA analysis, especially with niimnial stardng o^ianndes of tumor 

real-time PCR does not require post-PCR. sample handling, samples (small, early-stage tumors- and formalin-fixed, paraffin- 

thereby preventing potential. PCR-product carry-over con- embedded tissues): 

The real-time PCR method, was used to develop and validate E-**^ aft ^ a 8™* number of cycles (end-point quantita- 

a simple and rapid assay for the detection and quantification ttye °F aft f . a .y*W*& ™nrf*r of cycles during the 

of the 3 most, frequently; amplified genes (myc. ccnoM and exponential phase (kineUc quantitative PCR); In the first case, an 

- erfeB2) m. breast tumors* Extra copies of myc,.ccndT and erbB2 • internal standard distinct from the target molecule is required, to 

were observed' in 10, 23 and .15%, respectively, of 108 breast- ascertain PCR efficiency. The. method is relatively easy but implies 

tumor DNA; the largest observed numbers of gene copies generating, quantifying and storing an internal standard for each 

were 4.6, 18.6 and 15 % respectively. These results correlated gC ne studied. Nevertheless, it is the' most frequently, applied 

well with those of Southern blotting. The use of this new method to date rajuemiy appuea^ 

. semi-automated technique will make molecular analysis of.. . - A ' , . 

human cancers simpler and more reliable; and should find 0116 or the majoradvantages of the kinetic method is its rapidity , 

broad applications .in clinical and research settings, int. J. m quantifying a new gene, since no internal standard is required (an 

Cancer 78:661-666, 1998L . . " ~ external standard curve is sufficient): Moreover, the kinetic method 

Q 1998 miey-Liss; Inc. . .has a wide dymunic range (at least 5 orders of magnitude), giving 

an accurate value for samples differing in their copy number. 

Gene amplification piays an important role in the pathogenesis Uirfortunately, the method is cumbersome and has therefore been 

: of various solid tumors,. including breast cancer, probablybecause u f ed ' lt mvoJves aliquot sampling of each assay- mix at. 

over-expression of the amplified target genes confers a selective. regula^tervaJs and quantifying, for .each aliquot, the ampiifica- 

advantage. The first-technique used to detect genomic amplification * on P*?* 1 ^ Utowf in the kinetic method has been stimulated by a 

was cytogenetic analysis. Amplification of several chromosome ? oveI \ appr °!™ ^g fluorescentTaqMan methodology and a new 

regions, visualized either as extrachromosomal double minutes ' u ^ trumen f CABl Prism7700 Sequence Detection System) capable 

(drains), or as integrated homogeneously staining regions (HSRs), °J ^^X^^ 1 ^™* inrcaIthne (<?*bson et aL, 1996; Hexd et 

are among the main visible cytogenetic abnormalities in breast ? l^o). The TaqMan reaction is based on the 5' nuclease assay 

tumors: Other techniques such as comparative genomic, hybridiza- • ™*\ dcscrio ?o by Holland et at (1991); The latter, uses the 5' 

tion (CGH) (Kallioniemi eroL. 1 994) have also been used in broad nuclease acUvity of Taq polymerase tocieave a specific fluorogenic 

searches for regions of increased DNA copy numbers in tumor 0 "g° nu cleotide probe during the extension phase of PCR. The. 

cells, and have revealed some 20 amplified chromosome regions in a PP roa f]J uses dual-labeled. fluorogenic hybridization probes (Lee 

breast tumors. Positional cloning efforts are underway to identify & ff/ '» l993 '' ° n *flu<>rescent dye, co-valently linked to the 5' end 

the critical: gerte(s) in each amplified region. To date, genes known ° f oHwpwctorid* serves as a.reporterIFAM (t.e., 6-carbbxy- 

to be amplified frequently in breast cancers include myc (8q24) fluorescein)] and its emission spectrum is quenched by a second 

ccnd\ (M.ql3), and eroB2(l7ql2-q21) (for review, see Bieche and . ^o^cem dye, TAMRA (Le. t c^»rboxy-tetiamethyl-modamine) 

Lidereau, 1995): ■ attached to the 3 end. During the extension phase of the PCR 

Amplification of the myc, ccndl, and er£B2 proto-oncogenes 
should have clinical relevance in breast cancer, since independent 



studies have shown that .these alterations can be used to identify Grant sponsors:- Association Pour' la Recherche air le Cancer and 

sub-populations with a worse prognosis (Beros et al„. 1992; Mmisterc de rEnseignement Sun^ 

Schuuring et al, 1992; Slamon et all. 1987). Muss et al (1994) - 

suggested .that these gene alterations may also be useful for the ♦Corn^ndencc t^Uboratoirs dc Genetique MolecuW Faculte des 

pre<hchonandassessmentofuieefficacyofadjuvantchemot Sciences Pharmaceutiques et Biologlqucade P^Cuc6 

and hormone therapy. ' # • rObservatoire. F-75006 Paris, France. Fax: (33)1^407-1754. 

• However, published results diverge both in terms' of the fre- B«ihiuvHiud8iMiB* 

quency of these alterations and their clinical value. For instance, _! 

over 500 studies in 10 years have failed to resolve the controversy. \ Received2May 1998;' Revised 30 June 1998V 
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cycle, the; fluorescent hybridization probe is hydrolyzed by the of extensive degradation} are bdtb difficult to assess. We therefore- 
5 -3 nudeolybcj^ atsaquantified a control gene (alb) mappmgtochrortwsomereeion 
.turn of the probe releases the quencbmg of FAM fluorescence 4qM^13, m which no genetic alterSo^ havc^™™£ irr" 
emission,. resuihng in an increase in peak fluorescence emission, breast-tumor DNA by means of c™Soni?mt J^T iXa : 
The fluorescence signal is normalized by dividing the emission f COT (Kalhoniemief a/... 1994). 
intensity of the. reporter dye (FAM) by the emission intensity of a ^ .? °1 fte number of the target gene to the copy 
reference dye (i.e., ROX, 6^carboxy-X-rhodamine) included in • aumbe f of ^ S<me normalizes the amount and quaUty of 
TaqMan buffer, to obtain a ratio defined as the Rn (normalized Senomic DNA. The ratio defining the level of amplification is . 
reporter), for a given reaction tube. The use of a sequence detector tcrmcd " N>? > ^ is determined as follows: 
enables, the fluorescence, spectra of all 96 wells of the thermal . ^.^..f * , Jt ■ 
cycler to be measured c^^ n - PY gcne (op * ^ ^ ^ B2 > 
. The real-time PCR method offers severajradvantages over other copy number "of reference gene {alb) 
current quantitative PCR methods. (Celr. ear aL, 1994): 0J the- P rt w~ ™w L> t , 
probe-based homogeneous, assay provides a real-time method for J^IL!^^ P"* ™* PC * 
detecting only specific amplification products, since specific hybri- ??S "SS^S™^ 7?™°™ ,"2? 
dation of both the primers and the probe is necessary to generate a. ° f .1 2T2^?E2L U f ° '° ^ NaUo ^ 1 Biosciences, Ply- • 
signal; fit) the Q (threshold cycle) value used for quant&cation is ^J^T! Systems, Cincrrrnati, OH) and Primer- 
measured when PCR amplification is still in the log phase of PCR tx P ress (Fedon-EImcr Applied Biosysteras, Foster City/CA).: . 
product accumulation. This is the main reason why Q is a more .. Fnmers were purchased from DNAgency (Malvern; PA) and 

. reliable measure of the starting copy number than are end-point - probes from Perkin-Elmer Applied Biosystems: 

measurements,.^ which a slight difference in a limiting component Nucleotide sequences for the oligonucleotide hybridization 

; .canhaveao^nc-effetf ^ probes and primers are available on request 

values gives a wider dynamic range (at- least 5 orders. of magni- Th e TaoMan prR r ^ rf >. m<Mt 71 w. ■ ;. 

hide), reducing the need for serial dilution;-^ The real-timeTCR m TSiSSSlIS^ T^S^J^T^ ^ 

method is run in a closed-tube system and requires no post-PCR Micro Amp caps were from Perkin-Elmer Applied Biosystems. 

. sample handling,, thus avoiding potential contamination; (y) the- Standard-curve construction: The kinetic, method, requires' a ' 

system is highly automated, since the instrument continuously* standard curve. The latter was constructed with serial dilutions of 

measures fluorescence in ail 96 wells of the mermal cycler during specific PCR. products, according, to Piatak et a/. (1993). In 

■" PCR amplification and the corresponding software processes, and practice, each specific PGR product was obtained by amplifying 20 

analyzes the fluorescence data; fvi) the assay is rapid, as results are n & °f* 8 standard human genomic DNA (Boehringer, Mannheim, 

avaiJablejust one minute after thermal cycling is complete; (vii) the Germany) with the same primer pairs as those used later, for 

sample throughput of the method is high, since 96 reactions can be real-thne quantitative PGR. The 5 PCR products were purified 

analyzed in 2 hr, , using MicrpSpin S-400 HR corumiis (Pharrnacia, Uppsala, Swe- 

Here, we applied this semi-automated procedure to determine : den> electrophbrezed through an acrylamide geT and stained with 

the copy numbers of the 3 raost fequentiy amplified genes in breast ethidium .bromide to check their quality. The PCR products were 

tumors (myc ccndl and erbB2\ zs well as 2 genes (alb and app), tiien quantified sr«ctr6photoraetricaliy and pooled, and serially 

located in a chromosome region in which no. genetic changes have diluted 1 0-fold in mouse genomic DNA{Clontech, Palo Alto, CA> 

been observed in breast tumors. The results for I08.breast tumors at a constant concentration of 2 ng/ul. The standard curve used for 

were compared with previous Southern-blot data for the same real-time quantitative PCR was based on serial dilutions of the pool 

sam P ,cs ' of *CR products ranging from 10?*' (UP copies of each gene) to 

10^ 10 (10 2 copies). This series of diluted PCR products Was 

- aliquoted and stored at -80°C until use. ■ 

MATERIAL AND METHODS I. " , . T, * ' 

Tumor and blood samples " m stan^ curve was vahdated by analyzing. 2 taiown. 

uinurunuuiooa samples. .. . quantiUes of calibrator human genomic DNA(20ngandSOng); 

Samples wererjb^^ PCR amplification. Amplification mixes (50 idV contained the 

singly from panents at the Centre Rcnefluguenin; none of the D NA (aro^d.20 n& a^SfiSSS ea of SenS 

patients had undergone radiotherapy or chemotherapy. Immedi- lOX TaoMan buffer f5 ah Ton n^AA^J^Ar^ A^ 

afcly after surgery, the tumor samples were placed in liquid ^dSB^fi£a ^^rSlr^^V^J 

nitrogen until extraction of high-molecular-wcight DNA. Patients \1 ¥1* I -i Si, * , ^™ m ^ 1 { ra ? Goldt ^ ^ of 

were included m.this study if the. tumor ^^fc?TOA fSJ?^{^ 

• preparation contained more than 60%. of tumor cells (histological wr^^n ^q^^^ "J g ^*°^ ?°™P^ * *™ & 

analysis): A blood sample was also taken from 18 of the £me ^Cand I0n^at9^C.^ 

patients. 95°C for 15 s and 65°C for I mm. Each assay included: a standard 

. DNA was. extracted from tumor tissue and blood leukocytes- 2™ SSm£ SmS^ " ***** ^^^^ 

according to standard methods. . 20 ng and 50 ng of calibrator human genomic DNA (Boehringer) in 

triplicate; and about 20 ng of unknown .genomic DNA in triplicate 

Real-time PCR ® 6 sam P les 03,1 ^ te analyzed on a 96-weil micropJate). All 

n , rtw ,} . ; ^ *' ' . t samples with a coefficient of variation (CV> higher than iO%were 

Theoretical basts.. Reactions are characterized by the point retested. • , . 

.during cycling when amplification of the PCR product is first All ._,. „. * ' j . • ' . ' ■•' 

detected, rather than by the amount of PCR product accumulated • Ml ^tions were performed in the ABI Pnsm T700 .Sequence 

after a fixed number of cycles. The higher the starting copy number ^ etectl °" System (Perkin-Elmer Applied Biosystems), which 

of the genomic DNA target, the earlier a significant increase in detects ^ 60111 fluorogenic probe during PCR. 

fluorescence is observed The parameter C» (threshold cycle), is Equipment /or real-time detection. The 7700 system has a 

defined as the fractional cycle, number at which the fluorescence built-in thermal cycler and a laser directed via fiber optical cables 

generated by cleavage of the probepasses a fixed threshold above to each jof the 96 sample wells, A charge-coupled-device (CDD) 

baseline. The target gene copy number in unknown samples is camera collects the emission from, each sample and the data are 

quantified by measuring Q and by using a standard curve to analyzed automatically. The software accompanying the 7700 

determine the starting copy number. The- precise amount of systemcaiculatesCt and determines the starting copy number inthe 

. genomic DNA (based on optical density) and its quality {Le., lack samples. 
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'Determination.^ of gene amplification. Gene - amplification was 
calculated as described above. Only samples with an N value 
higher than2 were considered to be amplified. . : 

RESULTS. * 

. To validate, the method, real-time PCR was .. performed on 
genomic DNTA extracted from 108 primary breast tumors, and 18 
normal leukocyte DNA samples from some of the same patients? 
The target genes- were the myc, ccndJ and erb&2 protcroncogenes, 
and* the p -amyloid precursor protein gene (app) t which maps to a 
chromosome region (21q2!\2) in which no genetic alterations have 
been found in breast tumors (JCallioniemi et al r 1994), The 
reference- disomic gene was the albumin gene {alb. chromosome 
4qll-ql3): 



Validation of the standard curve and dynamic range 
of real-time PCR . 

The standard curve was constructed from PCR produce serially' 
diluled in genomic mouse BNA.at a constant concentration of 
2 ng/ul. ft should be noted that the 5. primer pairs chosen to analyze 
the 5 target genes, do not amplify genomic mouse DNA (data not- 
shown). Figure 1 shows the* real-time PCR standard curve for the 
alb gene. The dynamic range was wide (at least 4 orders- of 
magnitude), with, samples containing as few as f.0 2 copies or as 
many as 10* copies. 

Copy-number, ratio of the 2 reference genes Yapp and ' alb) 

The app to alb copy-number ratio was determined in 18 normal 
leukocyte DNA^amples and all 108 primary breast-tumor DNA 
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ii^/ii^JSi^H ¥?£ , do ??§ c ;^x rea, : tlme PCRL J 0 * AmpUfication plots for reactions with starting alb genecopy number ranging 
tan W (A9), 10* (AT), 10* (A4) to I<F (A2) and.a no-template control (A 1). Cycle number is plotted chaiTge in normal&d leportcrshrf' 
Mtok to „K^„T«^ 5 C ff ^ rcSCcn «. S, ^ a! of *■ ^erdy* ?AM) is divided by the fluorescence si£ial of me pasrive mference^e 
S? i ?2r? a - «™ c ^ aj^nonnalacd ^ reporter signal (Rn). ARn represents the normalized reporter signal (Rn) minus the baseline 
n£f^ b ^ "Si* 6 ?? 15 PCR i^«ses during PCR as */* PCR product copy numberhcrease^til *e3oTreS« a 

PiLiT- S 7.5S e s5 epftse ?? s ** f p lCtl0,la, cvcIc number at which a significant increase in Rn above a baseline signal (horizontal black 

SfiiS first be Elected Two replicate plots were performed for each standard sample, but the data for only one are shown here. Bottom- 
£^r^ P ^. to i' a ? 1 ^ nUmbCr JW"** ^ic> The black dots represent the data for standard sairmles plotted 
duplicate and tbe red dots the data for unknown genomic DNA samples plotted in triplicate. The standard curve shows 4 orders^^Kic 
range* . * 



samples. We selected these 2 genes, because they ace located h> 2 reagents and requires, reladvelv W fl m«„ n t« of Wnkn,,,^ 

. the 18" normal- leukocyte DNA samples feti \^nM™ 5 oZt£ of LllKSf h 8 ™* 

(mean L02.± 0.21), and was similar for the 108 primary breast- Sm^ cytopuncture 

tumor DNA samples. (0.6 to 1.6,.mean L06 ± 0.25X cSmiXg • ™ * paraffin^mbedded Ussues), 

> that alB and app are appropriate reference disomic genes for ° 2 f"*^ we validated a PCR method developed for the 

breast-tumor DNA. The low rangrof the ratios also confirmed that CaUon 1 °. gene ^-^presentation in tumors. The method; 

the nucleotide sequences chosen for the primers and probes were f ° ? r ^ 1 ^ mc t anal >' sls of PCR amplification, has several 

not polymorphjc.as mismatches of meir.primers or probes with the- ^ £es over o^r PCR : based quantitative assays such as 

, subject's DNA would have resulted™ differential amplification. c^penuve quanuianve PCR (Celi et aL 1994). First, the leal-time. 

■ ' " ' • n •« ' * " t ' ' ' '. ,™ method ,s performed in a closed-tube system, avoiding the 

myc, ccndl -and eibB2gene dose.in normal leukocyte DNA ' risk of contamination by amplified products. Re-amplification of 

To determine the cut-off point for gene amplrffcationin breast- carryover PCR products in subsequent experiments can also be 

cancer tissue,. 1 8 normal leukocyte DNA samples Were tested for Prevented: by using the enzyme, uracil N-glycosylase (TJNG) 

the gene dose (NX .calculated as described in ^Material and _(Longo et aL, 1990). The second advantage is the simpliciQr and 

Methods". The N value of these samples ranged from. 0.5 to 1 3 ™P ld **y of sample analysis, since no post-PCR manipulations are 

(mean 0.84 ± 0.22) for myc 0:7 to 1.6 (mean 1.06 ± 0O3) for required. Our results show that the automated method is reliable 

card! and 0.6 to 13 (mean 0.91 ± 0.19>fbr er6B2. SinceN values- w * found il possible to determine, m triplicate, the .number, of 

. for myc ccndl and er6B2 in normal leukocyte DNA consistently C0 P ies of a target gerie in more.&an.l00 tumors per day. Third the 

fell between 0.5: and 1.6, values of 2 or more were considered to. system has a linear dynamic range of at least 4 orders of magnitude 

represent gene anwlification in tumor DNA; : • • meaning that samples do not have to contain equal starting amounts 

myc,ccnd!^erb2^ . ffitn^^ 

• ■ »^ ,^ „tDi- . loimaiin-nxea, parattin-embedded tissues. Fourth, and above all, 

h™^i ™ £ a S ?% € &VS** a m ^ »°8 Penary teal-tmie PCR makes. DNA quantification much more precise. and 
m^Sn. OW P ^n« T * fc * !?" ^P^^^were reproducible, since it. is, based on C, values rather lhaSend-pohrt 
: KtefKlT T* of f rtB ?o ( i 3 ! /o » r asmement ofthe amount of accumulated PCR product Indeed 
-Kmriw 18 } "? T^f^ I * L86 f0t ' JneABI Prism 7700 Sequence Detection System enablesQ tobe 
tr'2 jli^^ 82, ^onJy 2» 4.6 for thein^ gene, calculated when PCR amplification is still in the expMentia?rjriase 
Figure 2 and Table H represent tumois in which thzccndl gene was. and when none of- the reaction cDmwmpnr, ^ Sn„ P t£ 
amplified I6-fbld (T145), 6-fold CTU^^dnon-amptifiJcTHS). wimih-run W Ste^^^Tnto ^^M 
The £gen«we re neverfouftdto.be«>^^ replicates) was a!ways,betow 5% andu?eb^^^oreckion 

lf 2 i~ d ^ in S diffeienl runs wL always ^etow To^^fsS 

^ \™ lE i eH ' B2 .^ 1 "^.l™ 5 fevo « * e fcyP 0 *^ . addition, the use of a standard curve is nol Sutely ne^^ry 

rfKSS otoinedbXe^meK 

. Comparison of gene dose determined by. real-time quantitative ' sterna! control (the design andistoragtrof internal controls and the 
PCR and Southern-blot analysis ... validation of their amplification efficiency is laborious). 

• . SouthernTblot -analysis of myc t ccndl and er6B2 amplifications ^J^Z on ty P 0 ** 11 *' 3 * disavantage of.real-time PCR, like all other 
had previously been done on the same 1 08 primary breast tumors. A - PCR " Dase " methods and solid-matrix blotting techniques (South- 
perfect- correlation between the results of real-time PCR and f" 1 blots .and dor blots) is that is cannot avoid dilution artifacts 
Southern blot was obtained for. tumors with high copy numbers * inncrent »>the extraction of pNA from tumor cells, contained in 
(N s 5). However, there were cases (I myc, 6 ccndl and 4 erhB2) heterogeneous tissue specimens. Only RSHand immunohistochem- 
m which real-rime PCR showed gene amplification whereas 030 measure alterations on a cell-by-cell basis (Pauletti >r a/.. 

Southem-bldt did not, but these were mainly cases with low extra l996; SIa mon et aL, 1989); However, FISH requires expensive 
copy numbers (Nfrom 2 to 2.9).. • equipment and trained personnel and is also toe-consuming, 

"t Moreover, FISH does, not assess gene, expression and therefore 

DISCUSSION ' detect ca^s in which the gene product is over-expressed in 

, - - the absence, of gene amplification, which will be possible in the 

The clinical applications of gene amplification assays are ^e real-time quantitative RT-PCR. Immuriohistochemisby is 
currently limited, but would certainly increase if a simple; standard- subject to considerable variations in thehands of different teams, 
aed and rapid method Were perfected. Gene amplification status . owing, to alterations of. target proteins during the procedure, the 
has been studied mainly by means of Southern blotting, but this different primary antibodies and fixation methods used' and the 
method is not sensitive enough to detect low-level ^ene amplifica- criteria used to define positive staining: 

Uon nor accurate enough to quantify the fulFrange of amplification The results of this study are in agreement with those reDorted in 
values. Southern blotting is also tim^urniqg, uses radioactive- the literature, (i) Chromosome 5^iTS» SKffiS 

(which bear alb ' znd.app.. respectively), showed no genetic alters 
m "J. . aribns in the breastveancer samples studied here, in keeninc with 

TA 8 S^a ggffl^ theresd.s ofCGH.aCaHionienS^ai, 1994). fa WelKS 

— amplifications of these 3 oncogenes were independent events, as 

reported by other teams (Bernsef afc 19% Borg etaL, 1992);.(f/i) 
The frequency and degreeof myc amplification in our breast tumor 
DNA senes were lower man those of ccndl and er6B2 amplifica- 
tion confirming the findings of Borg e7 aL (1992) and Courjal etoi 
(1997). (n/) The maxima of ccndl and erbBl over-representation , 
were. 1 8ifoid and 1 5-fold, also in keeping with earlienesults (about 
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Tumor c t Copy number Pt Copy number 

■ T118 27.3 4605 26.5 4365 

■ T133 23.2 61659 25.2 10092 
T145 22.1 125892 25.6 7762 



snl?£z c £^l: 9n * alb S«ne dosage by real-time PCRui.3 breast tumor samples: Tl 18 <EI2, C6, black squares), T133 (GU, B4, red squares! 
and T145 (A8, C8, blue squares). Given the C, of each sample, the initial copy number is inferred from thestandard curve obtained-during the same 
experiment Triplicate plots were performed for each tumor samplei butthe data for only One are shown Jiere. the results are shown in Table II. 

3 ^ f ^^ mum )(Bernse/a/.. '.1992; Bag erat, 1992; Courjal et et at, 1996). Our results also correlate well with those recently 

aL. 1997). (v) The er6B2 copy numbers obtained with real-time published by Gelmini etc/. (1997), who used the TaqMan system to 

PGR were in good agreement with data obtained with other measure eroB2 amplification in a small 'series of breast tumors- 

quantitative PCR-based assays in terms of the frequency and (n - 25), but withaniiistrument (LS-50B liuninescence spectmm- 

degree.of amplification (An et ai. 1 995; Deng et al; t 1996; Valeron eter, Pcridn-Elmer Applied Biosystems) which only allows^ end- 
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' 'For each sample, 3 replicate experiments were p^^ 1115 ^ a™* ^ mean 
and fee standard deviation (SD) was detemtmcd. the level of ccndl gene 
amplification (Nccndl/alfy is determined by dividing the average ccndl 
copy number value by the average alb copy number value. 



point measurement of fluorescence' intensity. Here we report myc 
and ccndl gene dosage in breast cancer by means of quantitative 
POL (yi) We found a high degree of concordance* between 
real-time quantitative PGR and Southern blot analysis in terms of 
gene amplification, especially for samples with high copy numbers 
(2:5-fold). The. slightly "higher frequency of gene amplification 
(especially ccndl and er&B2) observed by means of real 7 time 
quantitative PGR as compared with Southern-blot analysis maybe 
explained by the higher, sensitivity of the former method. However, 
we cannot rule out the possibility mat some tumors with a few extra 



gene copies observed' in real-time tGR had addiriorml copies of an 
arm or a whoie chromosome (trisomy, tetrasomy or polysomy) ■-" 
rather than- true gene* amplification. These 2 types of genetic 
alteration (polysomy and gene " amplification) could, be easily 
' distinguished in the future by using an additional probe located' on ■ 
the same chromosome arm, but some distance from the target gene; ■ 
It is noteworthy that high gene copy numbers have the greatest : 
prognostic; significance in breast carcinoma. (Borg et a/., -1992; 
Slamon etal. 1987). > ' 

Finally, this- technique can be applied to 'the detection of gene:, 
deletion as well; as gene amplification. Indeed, we found a - 
decreased copy, number of eroB2 (but not of the other5.proto- 
oncogenes) in several tumors; erbBl is. located in* a chxombspme 
region (T7<£t) reported to contain both deletions and amplifica- ' 
dons in breast cancer (Bieche and Lidereau, 1995). 
. rn conclusion, gene ampli fication in various cancers can be used 
as a marker pf pre-neoplasia, also for early diagnosis Of. cancer, 
. staging, prognostication and choice- of treatment Southern clotting 
is not. sufficiently sensitive; and FISH is lengthy and complex. 
Realrtime quantitative PCR overcomes both these limitations, and 
is a sensitive and accurate, method of analyzing-.. large, numbers of 
samples in a short time. -It should find a place in routine clinical 
gene dosage.. 
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~10x). Also, some tumor lines have amplified ras 
genes. A 20-fold increase in the level of a nontrans- 
forming Ras protein is sufficient to allow the 
transformation of some cells. The effect has not 
been fully quanUlated, but it suggests the general 
conclusion that oncogenesis depends on over- 
activity of Ras protein, and is caused either by 
increasing the amount of protein or (probably more 
efficiently) by a variety of mutations that increase 
the activity of the protein. 

Transaction by DNA can be used to transform 
only certain cell types. Although transforming 
oncogenes have been isolated from both rodent and 



human cells, most targets for transformation by 
transfection with oncogenes have been rodent 
fibroblasts in culture. (In fact, the difference in the 
source of the oncogene [human] and the recipient 
cell [rodent] is an important factor in allowing 
the donor gene to be distinguished unequivocally 
from recipient DNA.) Limitations of the assay 
explain why relatively few oncogenes have 
been detected by transfection. This system has 
been most effective with ras genes, where there 
is extensive correlation between mutations that 
activate c-ras genes in transfection and the 
occurrence of tumors. 




A variety of genomic changes can activate proto- 
oncogenes, sometimes involving a change in the 
target gene itself, sometimes activating it without 
changing the protein product. In cases of insertion 
and translocation, there is evidence that the 
genomic change is the causative event; in cases of 
amplification there is a correlation with tumori- 
genesis, but no direct proof for a causative role. 

Many tumor cell lines have visible regions of 
chromosomal amplification, as shown by homo- 
geneously staining regions (see Figure 56.27) or 
double minute chromosomes (see Figure 56.28). In 
some cases, the amplified region contains a known 
oncogene or a gene related to one. In other cases, 
where amplification is not visible, the use of 
batteries of probes representing oncogenes shows 
that a particular oncogene is amplified. Examples 
of oncogenes that are amplified in various tumors 
include c-myc, c-abl y c-myb, c-erbB> andc-K-ras. 

Established cell lines are prone to amplify genes 
(it is one of several karyotypic changes to which 
they are susceptible). All the same, the presence of 
known oncogenes in the amplified regions, and the 
consistent amplification of particular oncogenes in 
many independent tumors of the same type, again 



strengthens the correlation between increased 
expression and tumor growth. Of course, it is 
possible that the gene amplification gives an 
advantage to growth of the established tumor; it is 
not necessarily an event involved in its initiation. 

Some proto-oncogenes are activated by events 
that change their expression, but which leave their 
coding sequence unaltered. The best characterized 
is c-myc, whose expression is elevated by several 
mechanisms. One common mechanism is the 
insertion of a nondefective retrovirus in the vicinity 
of the gene. 

The ability of a retrovirus to transform without 
expressing a t>o/7<? sequence was first noted during 
analysis of the bursal lymphomas caused by the 
transformation of B lymphocytes with avian virus. 
Similar events occur in the induction of T cell 
lymphomas by murine leukemia virus. In each case, 
the transforming potential of the retrovirus seems to 
lie with its LTR rather than with a coding sequence. 

In many independent tumors, the virus tins 
integrated into the cellular genome within or close 
to the c-myc gene. The gene consists of three 
exons; the first represents a long nontranslated 
leader, and the second two code for the c-Myc 
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protein. Figure 59.8 summarizes the types of in- 
sertion at this locus. The retrovirus may be inserted 
at a variety of locations relative to the c-myc gene. • 

The simplest insertions to explain are those that 
occur within the first intron. The LTR provides a 
promoter, and transcription reads through the two 
coding exons. Transcription of c-myc under viral 
control differs in two ways from its usual expres- 
sion: the level of expression is increased (because 
the LTR provides an efficient promoter); and the 
transcript lacks its usual nonlranslaled leader 
(which may usually limit expression). 

Activation of c-myc in the other two classes of 
insertions reflects different mechanisms. The 
retroviral genome may be inserted within or 
upstream of the first intron, but in reverse orienta- 
tion, so that its promoter points in the wrong 
direction. Probably the LTR provides an enhancer 
that acts on an upstream sequence that fortuitously 
resembles a promoter. The retroviral genome also 
may be inserted downstream of the c-myc gene, in 
which case transcription initiates at the usual c-myc 
promoter(s) , but is increased by the enhancer in the 
retroviral LTR. 

In all of these cases, the coding sequence of c-myc 
is unchanged, so oncogenicity is attributed to the loss 
of normal cont rol, and increased expression, of the 
gene. 

Other oncogenes that are activated in tumors by 
; the insertion of a retroviral genome include c-erbB, 
I c-myb, c-mos, c-H-ras, and c-raf. Up to 10 other cel- 
lular genes (not previously identified as oncogenes 
jrby their presence in transforming viruses) are 
| implicated as potential oncogenes by this criterion. 
% The best characterized among this latter class are 
\wnti and int2. The wntl gene codes for a protein 
|involved in early embryogenesis that is related to 
flhe wingless gene of Drosophila; int2 codes for a 
growth factor related to FGF. 
\ : - Translocation to a new chromosomal location is 
: another of the mechanisms by which oncogenes are 
Activated. Certain chromosomal translocations are 
^consistently associated with activation of onco- 
genes that lie near the breakpoints. This situation 
$vas originally discovered via a connection 
|between the loci coding immunoglobulins and the 



Figure 39.8 
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occurrence of certain tumors. Specific chromo- 
somal translocations are often associated with 
plasmacytomas in the mouse and with Burkitt 
lymphomas in man. These tumors arise from 
undifferentiated B lymphocytes. The common 
feature in both species is that an oncogene on one 
chromosome is brought into the proximity of an Ig 
locus on another chromosome. Similar events 
occur in T lymphocytes lo bring oncogenes into the 
proximity of a TcR locus. 

The basic cause of the translocation event is a 
malfunction of the system responsible for recom- 
bining V and C genes or switching between IgH 
C genes. Instead of acting on two sites within ihe Ig 
or TcR locus, the system recombines the immune 
locus with an unrelated region on a different 
chromosome. This results in a reciprocal 
translocation, which is illustrated in Figure 59.9. 

We do not know the basis for the involvement of 
the nonimmune partner, but in both man and 
mouse it is often the c~myc locus. In man, the 
translocations in B cell tumors usually involve chro- 
mosome 8, which carries c-myc, and chromosome 
14, which carries the IgH locus; -10% involve chro- 
mosome 8 and either chromosome 2 (k locus) 
or chromosome 22 (lambda locus). The trans- 
locations in T cell tumors often involve 
chromosome 8, and either chromosome 14 (which 
has the TcRa locus at the other end from the Ig locus) 
or chromosome 7 (which carries TcR p locus). 
Analogous translocations occur in the mouse. 

Translocations at the IgH locus in B cells fall into 
two classes. One type is similar to those observed 
at other Ig loci and at TcR loci, involving the 
consensus sequences used for V-D-J somatic 
recombination of active Ig genes. In the other type, 
the translocation occurs at a switching site, so these 
cases may be associated with function of the 
system thai switches expression from one C n gene 
to another. 

When c-myc is translocated to the Ig locus, its 
level of expression is usually increased. The 
increase varies considerably among individual 
tumors, generally being in the range from 2 lo 10. 
Why does translocation activate the c-myc gene? 
The translocation event does not involve fixed sites, 




A chrompsom^ translocation is a reciprocal 
eventthk^xch^g of twp chromosomes. 
Translocatfons that activate the human c-/nyc 
proto-qnoogene involve Ig loci in B cells and TcR 
loci In T cells- ; 
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but occurs at a variety of locations within a general 
region on each recomhining chromosome. The 
event has two consequences: c-myc is brought into 
a new region, one in which an Ig or TcR gene was 
actively expressed; and the structure of the c-myc 
gene may itselfhe changed (but usually not involv- 
ing the coding regions). It seems likely that several 
different mechanisms can activate the c-myc gene 
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in ils new location (just as retroviral insertions 
activate c-myc in a variety of ways). 

The correlation between the tumorigenic 
phenotype and the aclivalion of c-myc by either 
insertion or translocation suggests that continued 
high expression of the c-Myc protein is oncogenic. 
Expression of c-myc must be switched off to enable 
immature lymphocytes to differentiate into mature 
B and T cells; failure to turn off c-myc maintains the 
cells in the undifferentiated (dividing) slate. The 
oncogenic potential of c-myc has been demon- 
strated directly by the creation of transgenic mice 
carrying a normal c-myc gene linked to an 
enhancer. Transgenic mice carrying a c-myc gene 
linked to a B lymphocyte-spe cific enhancer (the IgH 
enhancer) develop lymphomas. The tumors repre- 
sent both immature and mature B lymphocytes, 
suggesting that over-expression of c-myc is tumori- 
genic throughout the B cell lineage. Transgenic 
mice carrying a c-myc gene under the control of the 
LTR from a mouse mammary tumor virus, 
however, develop a variety of cancers, including 
mammary carcinomas. This suggests that 
increased or continued expression of c-myc trans- 
forms the type of cell in which it occurs into a 
corresponding tumor. Specificity of the tumor type 
may therefore depend on the mechanism used 
to activate c-myc; it is not an intrinsic property 
of the gene. 

c-myc exhibits three means of oncogene 
activation: retroviral insertion, chromosomal 
translocation, and gene amplification. The com- 
mon thread among them is increased expression of 
the oncogene rather than a qualitative change in its 
coding function, although in at least some cases 
the transcript has lost the usual (and possibly 
regulatory) nontranslated leader, c-myc provides 
the paradigm for oncogenes that may be effectively 
activated by increased (or possibly altered) 
expression. 

Every translocation generates reciprocal prod- 
ucts; sometimes a known oncogene is activated in 
one of the products, but in other cases it is not 
evident which of the reciprocal products has 
responsibility for oncogenicity. Also, it is not 
axiomatic that the gene(s) at the breakpoint have 



responsibility; for example, the translocation could 
provide an enhancer that activates another gene 
nearby. 

A variety of translocations found in B and T cells 
have identified new oncogenes. In some cases, the 
translocation generates a hybrid gene, in which an 
active transcription unit is broken by the transloca- 
tion. This has the result that the exons of one gene 
may be connected to another. In such cases, there 
are two potential causes of oncogenicity; the proto- 
oncogene part of the protein may be activated in 
some way that is independent of the other part, for 
example, because it is over-expressed under ils 
new management (a situation directly comparable 
Willi the example of c-myc); or the other partner in 
the hybrid gene may have some positive effect that 
generates a gain-of-function in the part of the 
protein coded by the proto-oncogene. 

One of the best characterized cases in which a 
translocation creates a hybrid oncogene is provided 
by the Philadelphia (PH X ) chromosome present 
in patients with chronic myelogenous leukemia 
(CML). This reciprocal translocation is too small to 
be visible in the karyotype, but links a 5,000 kb 
region from the end of chromosome 9 carrying c-abl 
to the fccrregion of chromosome 22. The bcr (break- 
point cluster region) was originally named to 
describe a region of -5.8 kb within which break- 
points occur on chromosome 22. Different cases of 
CML have breakpoints at different locations within 
this region. 

The consequences of this translocation are sum- 
marized in Figure 59.10. The bcr region lies within 
a large (>90 kb) gene, which is now known as the 
bcr gene. The breakpoints in CML usually occur 
within one of two introns in the middle of the gene. 
The same gene is also involved in translocations 
that generate another disease, ALL (acute lym- 
phoblasticleukemia); in this case, the breakpoint in 
the bcr gene occurs in the first mlron. 

The c-abl gene is expressed by alternative 
splicing that uses either of the first two exons. The 
breakpoints in both CML and ALL occur in the 
intron that precedes the first common exon. 
Although the exact breakpoints on both chro- 
mosomes 9 and 22 vary in individual cases, the 
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common outcome is the production of a transcript 
coding for a Bcr-Abl fusion protein, in which 
N-terminal sequences derived from bcr are linked 
to c-abl sequences. In ALL, the 185,000 dalton 



fusion protein has -45,000 daltons of the Bcr protein 
linked to c-Abl; in CML the fusion protein of 210,000 
daltons has -70,000 daltons of the Bcr protein. In 
each case, the fusion protein contains -140,000 
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daltons of the usual -145,000 c-Abl protein, that is, 
it has lost just a few N-lemiinal amino acids of the 
c-Abl sequence. 

Why is the fusion protein oncogenic? It relies 
on an interaction between the N-terminal region 
provided by bcr with the c-Abl protein. The bcr gene 
has a variety of sequence motifs related to proteins 
involved in signalling pathways, but the pertinent 
one is a serine/threonine kinase activity thai is 
coded within the first exon. This aulophosphory- 
lates residues in this part of the protein, and the 
phosphorylation confers the ability to interact with 
a region of the c-Abl protein called the SH2 domain 
(we discuss the nature of SH2 domains later). This 



raises the possibility that the Bcr part of the pro- 
tein interacts with the c-Abl sequences, perhaps 
changing their conformation and activating a 
latent oncogenic potential. 

Changes at the N-lerminus are involved in the 
activation of oncogenic activity of v-abt, a tran$- 
forming version of the gene carried in a retrovirus. 
The c-abl gene codes for a tyrosine kinase activity; 
this activity is essential for transforming potential 
in oncogenic variants. Deletion (or replacement) of 
the N-terminal region activates the kinase activity 
and transforming capacity. So the N-lerminus 
provides a domain that usually regulates kinase 
activity; its loss may cause inappropriate activation. 
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The common theme in the role of oncogenes in 
tumorigenesis is that increased or altered activity of 
the gene product is oncogenic. Whether the onco- 
gene is introduced by a virus or results from a 
mutation in the genome, it is dominant over its 
allelic proto-oncogene(s). A mutation that activates 
a single allele is tumorigenic. Tumorigenesis then 
results from gain of a function. 

Certain tumors are caused by a different mech- 
anism: loss of both alleles at a locus is tumorigenic. 
Propensity to form such tumors may be inherited 
through the germline; it also occurs as the result of 
somatic change in the individual. Tumorigenesis 
then results from loss of function. Such cases iden- 
tify tumor suppressors: genes whose products are 
needed for normal cell function, and whose loss of 
function causes tumors. The two best characterized 
genes of this class code for the proteins RB and p55. 

Retinoblastoma is a human childhood disease, 
involving a tumor on the retina. It occurs both as a 
heritable trait and sporadically (by somatic muta- 
tion). It is often associated with deletions of band 
ql4 of human chromosome 15. The RB gene has 
been localized to this region by molecular cloning. 

Figure 59.11 illustrates the situation. 



Retinoblastoma arises when both copies of the RB 
gene are inactivated. In the inherited form of the 
disease, one parental chromosome carries an alter- 
ation in this region, usually a deletion. A somatic 
event in retinal cells that causes loss of the other 
copy of the RB gene causes a tumor. In the sporadic 
form of the disease, the parental chromosomes are 
normal, and both RB alleles are lost by (individual) 
somatic events. 

Almost half the cases of retinoblastoma show 
deletions at the/?/? locus. In other cases, transcripts 
of the locus are either absent or altered in length. 
The protein product is absent from retinoblastoma 
cells. The cause of the tumor is therefore loss of 
protein function, usually resulting from mutations 
that prevent gene expression (as opposed to point 
mutations that affect function of the protein prod- 
uct). Loss of RB is involved also in other forms of 
cancer, including osteosarcomas and small cell 
lung cancers. 

What is the molecular function of RB? It interacts 
with a variety of other proteins, including several 
tumor antigens: SV40 T antigen, adenovirus El A, 
human papilloma virus E7. One possibility is that 
part of the oncogenicity of these proteins is due to 
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ABSTRACT 

Regulatory or structural alterations of cellular oncogenes have been implicated in the causation of variou 
cancers. Oncogene alteration by point mutations can result in a protein product with strongly enhanced on* 
cogenic potential. Aberrant expression of cellular oncogenes may be due to tumour-specific chromosomal 
translocations that dysregulate the normal functions of a proto-oncogene. Amplification of cellular onco. 
genes can also augment their expression by increasing the amount of DNA template available for the pro' 
duction of mRNA. It appears that amplification of certain oncogenes is a common correlate of ihi- progress 
sion of some tumours and also occurs as a rare sporadic event affecting various oncogenes in different types 
of cancer. Amplified copies of oncogenes may or may not be associated with chromosomal abnormalities 
signifying DNA amplification: double minute chromosomes and homogeneously staining chromosomul 
regions. Amplified oncogenes, whether sporadic or tumour type-specific, are expressed at elevated levels, 
in some cases in cells where their diploid forms are normally silent. Increased dosage of an amplified 
oncogene may contribute to the multistep progression of at least some cancers. 
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DNA SEQUENCE AMPLIFICATION AND 
CYTOGENETIC ABNORMALITIES 
IN TUMOURS 

Since its discovery in drug-resistant eukaryotic 
cells, somatic amplification of specific genes 
has been implicated in an increasing variety 
of adaptive responses of cells to environmental 
stresses (70, 79). Cytogenetic abnormalities, 
double minute chromosomes (dminrs) asso- 
ciated with DNA amplification had already 
been discovered in tumour cells before the 
discovery of dmur.s and homogeneously stain- 
ing chromosomal regions (HSR:s) in cells se- 
lected for drug-resistance (12, 24, 49, 50 f 56). 
In metaphase spreads, dmin:s appear as small, 
spherical, usually paired chromosome — like 
structures that lack a centromere (Fig. 2). HSR:s 
stain with intermediate intensity throughout 
their length rather than with the normal pat- 
tern of alternating dark and light bands in 
both trypshvGiemsa (Fig. 3A) and quinacrine 
dihydrochloride-stained preparations. Both 
kinds of abnormalities are occasionally found 
in metaphases of freshly isolated cancer cells 
but not of normal cells (8). 

Dmin:s and HSR:s are apparently rare in 
tumour cells in vivo, although exact data are 



difficult to obtain since the abnormalities arc 
easily missed in routine cytogenetic analysis 
(8, 42). Dmin:s and HSR:s have been de- 
scribed in most types of in vitro-culturcd malig- 
nant tumour cells, with a notable frequency 
in neuroblastoma cell lines [11]. Initial growlh 
in cell culture apparently selects for tumour 
cells that contain either dmin:s or HSR:s, 
Moreover, in culture dmin:s are frequently 
lost, concomitant with the appearance of 
clonal populations of cells that have devel- 
oped an HSR, suggesting that the two cyto- 
genetic abnormalities are alternative forms of 
gene amplification and that HSR:s may confer 
a selective advantage on cells over dmin:s (11 , 
70). It has been assumed that HSR:s can break 
down to firm dmin:s and that dmin:s can in- 
tegrate into chromosomes to generate HSR:s 
(11, 23). Amplified genes may also occupy ab- 
normally banding regions, ABR:s (51, 59). Ex- 
perimental work on drug-resistant cells has 
shown that in the absence of a selection pres- 
sure (drug), dmin:s and the amplified genes 
that they contain are lost, whereas amplified 
DNA in the form of HSR:s is retained in [he 
cells (71). This is explained by the fact that 
dminrs are segregated unevenly in mitosis ana 
frequently get lost from the nucleus due to 
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Amplification of cellular oncogenes in cancer cells 

TABLE 1 

Currently known oncogenes. 
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ONCOGENES FOUND IN RETROVIRUSES 






Retrovirus 
{example} 


Oncogene 




Gene product 






Cellular location 


Function of protein 


Class 


RSV 

Y73V 

GR-FeSV 

Ab-MuLV 

FuSV 

ST-and 

GA-FeSV 

UR2V 


$rc 
yes 

fgr 
abl 
fps/fes 

fes/fps 

r/i c 
f 


Plasma membrane 
Plasma membrane 
Plasma membrane 
Plasma membrane 
Cytoplasm 
(plasma membrane?} 
Cytoplasm 
(cytoskeleton?) 


Tyrosine-specific 
protein kinases 
[fgr contains sequences 
homologous to actin} 


Class la {Cytoplasmic 
tyrosine protein kinases) 


AEV 

SM-l-eSV 

MH-2V 

39U-MSV 

Mo-MSV 


crbB 

fnis 

mil/raf 
raf/mil 
mos 


Plasma membrane 
and cytoplasmic 
membranes 
Plasma membrane 
and cytoplasmic 
membranes 
Cytoplasm 
Cytoplasm 
Cytoplasm 


EOF" TPr&nirsr' c v t n . 
but 1 C^CUIUI o v,ylU 

plasmic domain 

Cytoplasmic domain of 
a growth factor receptor? 

? 
? 
7 


Place lU fPlopc f<l -o| n » n J 

\_iass id [Class ia*reiateu 
proteins) 


SSV 


sis 


Secreted 


PDGF-like growth factor 


Class 2 (Growth factors) 


Ha-MSV 
Ki-MSV 


Ha-ras 
Ki-ras 


Plasma membrane 
Plasma membrane 


GTP-binding proteins 


Class 3 {Cytoplasmic 
GTP:ases) 


FBJ-MuSV 

OK-lOV 

AMV 


fos 

myc 

myb 


Nucleus 
Nucleus 
Nucleus 


? 

Nuclear matrix protein 

? 


Class 4 (Nuclear phospho- 
proteins) 


SKV 770 
REV 
AEV 
E26V 


ski 
rel 

erb-A 
ets 


Nucleus? 
? 

? 

? 


? 
? 
? 
? 


Unclassified 


ONCOGENES FOUND IN TUMOUR CELLS BUT NOT IN RETROVIRUSES 
Tumour cell 


Neuroblastoma 
Neuroblastoma 
Small -cell lung cancer 
Neurol/Glioblastomas 


N-ras 
N-myc 
L-myc 
neu 


Plasma membrane 

? 
? 

Plasma membrane 


GTP-binding 
? 

Growth factor receptor 


Class 3 
Class 4 
Class 4 
Class lb 



1 



their lack of centromeres, (49). HSR chromo- 
somes carry centromeres and are therefore 
divided equally at mitosis. If dmin:s and 
HSR:s contain amplified genes that encode 
growth-stimulating protein products, it would 
follow that the more stable chromosomal 
wni. the HSR, confers a greater selective 
growth advantage for cells. Although dmin:s 
and HSR:s have been described predominant- 
ly in tumour cells selected for resistance to cy- 
totoxic drugs, it is also clear that dmin:s and 
"SR:s may be present in cancer cells before 
*ny form of therapy (8). It was in this setting 
that we and others first chose to explore the 
Possible amplification of cellular oncogenes. 
l B y definition, cellular oncogenes are normally 



innocent genetic loci which can be activated 
to transforming genes in various ways). 1 



DMIN:S AND HSR:S CONTAIN AMPLIFIED 
ONCOGENES 

Table 2 summarizes the somatic amplifica- 
tions of cellular oncogenes so far reported in 

1 It is not the purpose of this review to deal with all forms 
ot DNA damage that have been found to activate cel- 
lular oncogenes. For the purpose of integrating the re- 
view into a coherent picture, however, the reader is 
given a list of known cellular oncogenes in Table 1 and 
the schematic Figure 1 illustrating the various ways in 
which the oncogenic potential of different proto-onco- 
genes can be activated. 
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Fig. 1. Activation of cellular oncogenes. The haploid complement of a proto-oncogene is schematically depicted in A, 
composed of three exons (black boxes] in a segment of DNA. The different activated forms are schematically outlined 
in B-G. The abbreviation c-onc stands for cellular oncogene, v-onc viral oncogene. DNA sequences with associated 
strong promoter/enhancer functions are striated, and an actively transcribed gene is marked with radiations. B. Acute 
transforming retroviruses have the capacity to transduce cellular oncogenes [cone] into their genome, modify them 
and reinsert their activated oncogenes (v-omc) into the genome of host animal cells as a part of their proviral forms. 
The activity of the v-o*c gene is greatly enhanced due to the associated promoter of the proviral long terminal repeat. 
Both increased dosage of the oncogene and structural mutations within its sequence may contribute to tumongenesis 
C. Slow transforming retroviruses without oncogenes replicate and reinsert their proviral copies into the host cell UNA 
during a latency period from infection to tumorigenesis. Tumor initiation through hyperplastic growth may begin, 
when the provirus integrates sufficiently close to a proto-oncogene to activate it through promoter or enchancer func- 
tions. It should be noted, however, that mutations have also been found in the oncogenes thus activated and that muta- 
tional damage to other oncogenes has been described in the resulting tumors. D. In some mouse plasmacytomas, a 
retrovirus-like DNA element {directing the synthesis of the so-called intracisternal A-type particles. IAPs) has been 
found in association with a transcriptionally activated oncogene c-mos. The IAP insertion also disrupts the a part of 
c-mos (64). E. In humans, as well as in animals, chromosome translocations may place proto-oncogenes into transcrip- 
tionally active regions of chromatin, where they may be activated. The details of this mechanism have not been worked 
out but it is believed to occur for c-myc and c-ooJ genes in Burkitt lymphomas and Philadelphia-chromosome positive 
leukemias, respectively {35, 40i: F. Increased amounts of oncogene-specific RNA and protein can also result from an 
excess of DNA template for transcription acquired through oncogene amplification. The present review concentrates 
primarily on this mechanism. G. Mutationally activated oncogenes have been found in nearly one fifth of human malig- 
nant tumours. Oncogene loci activated by somatic structural mutations are revealed by transfection experiments 
where they are introduced into genetic background of nontumourigenic cultured immortalized cells Several sucn 
transforming loci have been cloned and many of them belong to the c-ros oncogene family. It should be pointed out 
that both structural mutations and either increased expression or activation of a complementing oncogene may oe 

required to achieve a fully tumorigenic phenotype (44). 



tumour cells. Although the sampling of tu- 
mours is at present small, the finding of 
known cellular oncogenes among amplified 
DNA represented by dmin:s and HSR:s of 
cancer cells is provocative. Amplification has 
been found to affect at least five out of twenty 
known cellular oncogenes and the degree of 
gene amplification varies from five to many 
hundred-fold over the single haploid copies 
found in normal cells (see also ref. 18). The 
first amplification reported involved the c-myc 



oncogene (see Table 1) in a promyelocyte leu- 
kaemia cell line HL-60 (20, 25). The degree of 
c-myc amplification is between 8—32 fold 
both in the HL-60 cell line and in primary leu- 
kaemic cells from the patient (20, 25). Origi- 
nal clonal lines of HL-60 were later found to 
contain some dmin:s in culture but their num- 
ber was insufficient to establish any clear cor- 
relation with amplified c-myc. Such a correla- 
tion, however, was discovered for c-myc am- 
plification in a neuroendocrine cell line from 
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CHROMO 
SOMES 



DMIN:S 



'l C0LO32nnM ? * ? hron,osoraes |arrowheads| in 

> resol?J ° DM C °1°5 « rcin <>™a cells. A. The dmin:s are 

> ihU i pa .' r t d d< ? ,s amon 8 normal chromosomes in 
I S"S'^"^t 5,aW Preparation B-D. 

' Warir. «^i :, Xl! fferenUal centrifugations. B. 
I fi« rf, !■ 8 ^ alena i C - c "romosome fraction, D. Puri- 
! "«J dm.n:s ,Donna George and the author, unpublished 
data and ref. 52|. 



| J colon carcinoma. COLO 320 (5). In these 
Umt • a PP roxima tely 30-fold amplified 
amart^f W6re mapped either t0 H SR:s of 
£ 1 ch J. omoso ™ I 5 ' Fi 8 3BJ or to dmin:s 
j depending on the particular subline stud- 

! Wniar^f ' S a,reacJ y present in th e 
U , t r°, U !" C , ells u from this colon carcino- 

' CSV kdy th3t ^ had a,so be ^n 
Sit? dunn § 8 rowth of the tumour ^ vivo. 
Cn. , y * am P hf,ed copies of the c-Ki-ras on- 

6 e ne were manneH j t . ■ — 



Co ePn '7' «»»f»«cu copies ot the c-Ki-ras on- 

«mo?,« i 6 mappe . d t0 dmin:s and HSR:s of 
te„". adre n°cortical tumor Yl (741. An ex- 

wrr e11 ? h , as since revea,ed amp ««'a- 

"s that do not show up as dmin:s or HSR:s. 



-Art 



fhl'G.'wH^ h« ° 8e, l e0US [ y slainm 8 re 8 ions l HSR > ^ 
w ,? SR J marker chromosome comprise a ma- 

™LK rl 't n of *' s lon * and «hort arms. The HSR- 
marker chromosome has evolved from an X-chromosome 
(52 and unpublished data of C. C. Lin and the author?T 

I COW 3 f o£!u ampli ? ed °< the ^"oncog'ene 
Hsr?, rt ? ?? lls . were foun <» t0 °e located to dmin:s and 
The la,,er »s shown here by in situ-hybridization 
(5, 52). 



Thus for example, the c-myb oncogene is 
amplified in a characteristic marker chromo- 
S f a c ° lo e n carcinoma without evidence 
of HSR:s (ref 6, Fig. 4) and in other tumours, 
the amplified c-aW and c-myc oncogene loci 
map to abnormally banding regions (ABR:s) in 
translocated or resident chromosomal seg- 
ments, respectively (59, 76}. 

TRANSLOCATIONS AND 
REARRANGEMENTS MAY ACCOMPANY 
ONCOGENE AMPLIFICATION UM ™ WY 

The evolution and progression of the karyo- 
gPV* tum< > u r cells is complicated (see ref 
08 J. Concomitant with amplification, DNA se- 
quences acquire an increased mobility in the 
genome with extrachromosomal intermediates 
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TABLE 2 

Sporadic and tumour-specific amplification of cellular oncogenes. * 



Tumour cells 



Onco- 
gene 



Fold 
amplified 



Sporadic: 

HL60 {acute prornyelo- c-myc 20 x 
cytic leukaemia, M3} 

COLO320 (colon c-myc 30 x 
carcinoma] 

Yl (adrenocortical c-Ki-ras 50 x 
tumour) 

COLO201/205 (colon c-myb lOx 
carcinoma} 



K562 (chronic myelo- 
genous leukaemia, CML} 



c-abl 10 x 



Chromosomal 
location of 
amplified gene 

8q(ABRj 
dmin, HSR 
dmin, HSR 
marl 

mar(ABR) 



Expression 
elevated 



Remarks 



References 



A431 (epidermoid 
carcinoma) 



cerbft 15— 20 x n.d. 



ML 1—3 (acute myeloid c-myb 5— 10 x 
leukaemia, M2) 



SK BR-3 (breast 
carcinoma) 

SEWA (polyoma virus- 
induced mouse tumour) 



Lu-65 (lung giant cell 
carcinoma) 

Primary leukemic cells 
from an acute myeloid 
leukemia (M2| patient 



c-myc 10 x 
c-myc 30 x 



c-myc 8x 
c-Ki-ros 10 x 
c-myc 33 x 



n.d. 

n.d. 
n.d. 



n.d. 
n.d. 
n.d. 



Yes 
Yes 
Yes 
Yes 

Yes 

Yes 

Yes 

Yes 
Yes 



n.d. 
n.d. 
n.d. 



Tumour-specific: 

small-cell lung cancer c-myc up to 80 x n.d. 

L-myc, 
N-myc 

N-myc up to 250x dmin, HSR 



Neuroblastomas 



Yes 
Yes 



Glioblastomas 



c-er&B — 



Amplification present in 20, 25. 59 
primary leukaemic cells 
Part of the amplified 4, 5, 52 

c-myc sequences rearranged 
Levels of p 21 c Ki,a * 74 
protein elevated 

Patient treated with 4, 6,88 

5-fluorouracil prior to 
culturing of the tumour cells 
C k coamplified in the 21. 22, 41, 

marker that may be derived 54, 76 
from chromosome 22, c-abl 
protein-associated tyrosine 
kinase activated 

Amplification linked to 82 
chromosome 7 translocation 
and sequence rearrangements 
Amount of protein product, 
the EGF receptor, elevated (see 36) 
Abnormalities of chromo- 34, 61. 91 
some 6q22— 24, where 
c-myb is normally located 

43 

Cells have dmin:s depending Manfred 
on culture conditions; Schwab, 



c-myc amplification 
correlates with growth 
as a tumour 

At least some copies of 
c-Ki-ras mutated 



personal 
commu- 
nication 
80 

Unpublished 
data of the 
author and 
A. de la 
Chapelle 



Most amplifications in the 53. 69 
variant phenotype of SCLC 

N-myc also amplified in 14, 48, 72, 
primary tumours of 73,. 75 

advanced grade 

Rearrangements also found Josef 
personal communication Schlessin- 

ger, per- 
sonal com- 
munication 



t d \r not 1 d .^ rn 2 ncd / mar - marker chromosome, M2. M3 refer to the French-American-Britlish classification of 
acute myeloid leukemias. 

* At least one case of oncogene amplification in normal germ-line cells has been found (18). 



visualized as dmin:s, transpositions and trans- 
locations to other chromosomal segments, etc. 
(see 70 for references). There may not be pre- 
ferred chromosomal sites for the apparent 
reintegration of dminrs as HSR:s (75). In at 



least one case, however, an oncogene may 
have been caught amplifying in situ in its resi- 
dent chromosomal site (59). The finding of 
moderately amplified oncogenes also in chro- 
mosomal sites lacking HSR:s suggests that 
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marl 





DNA jug 



A B 



Fig, 4. Localization of amplified c*myb in COLO 201/205 
cells by in situ hybridization. Shown is a characteristic, 
large marker chromosome (marl) with G-banding (A) 
and associated c-myb autoradiographic grains (B). Note 
the absence of HSR:s. Marl has probably evolved from 
chromosome number 6, the resident site of the c-myb on- 
cogene in normal cells (34, 88, 91). (Robert Winqvist and 
the author, unpublished data}. 



[oncojgene amplification may be more com- 
mon than the structural alterations shown by 
chromosome banding and microscopy (6, 88}. 

In at least three cases reported amplification 
has been accompanied by a DNA rearrange- 
ment of the oncogene (5, 20, 82). In the colon 
carcinoma COLO 320 both damaged and nor- 
mal versions of the c-myc gene are amplified 
(5|. Although individual cell clones have not 
yet been examined, our unpublished experi- 
ments suggest that the same dmin-containing 
cells harbor and express both normal and re- 
arranged forms of c-myc. The normal version 
?f the amplified gene, however, predominates 
m COLO 320 cells containing HSRrs; the rear- 
ranged version is present only in what ap- 
pears to be a single copy {Fig. 5). In the 
cnronic myeloid leukaemia (erythroleukae- 
mia > c eil line K562 an amplified DNA segment 
consists of portions of both the c-aW onco- 
gene and the immunoglobulin C\ locus (76). 
n both cases abnormal transcripts are prod- 
c ed from the rearranged amplified oncogenes 
J* 6 and ref. 22). In K562 cells, the abnor- 
vat A ° bl onc °S ene P ro <*uct has also been acti- 
ated cjs a tyrosine protein kinase (41). It is not 
nown, however, whether structural aiter- 
jons of the genes preceded amplification or 
nether they were acquired during the process 

gene amplification. It seems likely that a 
nromosomal translocation of c-abl to the C 
gus preceded DNA amplification in the 
cells, since ail amplified copies were al- 

reminis- 

"tof the Philadelphia translocation |t(9. 22)| 
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Fig. 5. Amplification and rearrangement of c-myc in 
COLO 320 cells. 10 M g of cellular DNA was digested with 
Sst I f electrophoresed, blotted and probed with a v-myc 
Pst I fragment {ref. 2, left panelj. Fragments of 2.7 kbp 
and 1.4 kbp are seen in both normal and amplified c-m>>c 
DNA. The 3.3 kbp fragment is derived from a DNA seg- 
ment of unknown origin translocated to the 5' region of 
c-myc with a concomitant deletion of its first exon (unpub- 
lished data of Manfred Schwab and the author). HSF t 
human skin fibroblasts; DM, COLO 320 DM cells; HSR, 
COLO 320 HSR cells. Different amounts of DNA from 
COLO 320 DM cells as indicated were mixed with calf 
thymus DNA to give 24 M g of total DNA, cleaved with 
Sst I ( electrophoresed, blotted and probed with a frag- 
ment of 3' human c-myc sequences. The intensities of the 
2.7 kbp c-myc fragment in different samples were com- 
pared to assess its copy number, estimated to be about 

30 |5J. 



found in most chronic myeloid leukaemia tu- 
mours (35, 66-68). Although they have not 
been sequenced, other reported cases of am- 
plified oncogenes are apparently normal on 
basis of mapping with restriction endonu- 
cleases (see Table 2). Therefore we cannot at 
present view mutation as a necessary com- 
panion of oncogene amplification. 



THE MECHANISMS OF GENE 
AMPLIFICATION 

The mechanisms of gene amplification and 
the structure of the amplified DNA have been 
worked out mainly in experimental settings 
involving selection for drug-resistance in cell 
culture (70). Although the mechanisms are 
still incompletely known and may vary in dif- 
ferent cases, some general features have 
emerged. 

A spontaneous degree of illegitimate DNA 
replication seems to exist in normal cells so 
that various segments of DNA are replicated 
more than once during a single cell cycle (37). 
In unselective conditions this DNA is probab- 
ly lost e.g., through formation of micronucleae 
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RNA 
DM HSR 



<)2.3kb 



Fig. 6. Comparison of the electrophoretic mobilities of 
c-myc mRNArs from COLO 320 DM and HSR cells. The 
size of the normal c-myc mRNA is 2.3 kb. The rearranged 
c-myc locus in DM cells (see Fig. 5) seems to be pre- 
dominantly expressed giving rise to a shortened RNA. 



because the newly synthesised extra copies of 
DNA are not covalently linked to chromo- 
somal DNA of mitotic cells (65 ( 71 (. If, how- 
ever, there is a selection pressure to retain an 
increased gene dosage, progressive multipli- 
cation of gene copy number results. The in- 
cidence of cells bearing amplified genes under 
conditions of cytotoxic selection can vary by 
two orders of magnitude and is greatly in- 
creased by the presence of mitogenic sub- 
stances (hormones or tumour promoters) dur- 
ing selection (10, 84, 85) or certain carcino- 
genic or cytotoxic agents before selection [15, 
55, 79, 80, 81, 85). An interesting hypothesis 
suggested by Varshavsky (84, 85) supposes that 
the origins of DNA replication "fire" (initiate 
replication) illegitimately several times during 
a single cell cycle and that this kind of "repli- 
con misfiring" may be increased by substances 
such as tumour promoters and mitogenic hor- 
mones (10, 84, 85). Mariani and Schimke (55) 
point out that most of the cytotoxic agents that 
increase the incidence of gene amplification 
are inhibitors of DNA synthesis. Aberrant 
replication is known to take place after tran- 
sient inhibition of DNA synthesis and this re- 
sponse can lead to gene amplification (46, 47, 
55, 90). Mitogenic hormones probably increase 
disproportionate DNA replication, but they 



also enhance the colony forming efficiency 0 f 
drug-resistant cells in selective conditions ( 10}. 

According to the studies of Axel and his col. 
laborators (65), the multiple cycles of un- 
scheduled DNA replication at a single locus 
during a single cell cycle result in a structure 
schematically outlined in Fig. IF. The hydro- 
gen-bonded amplified copies of DNA depicted 
in Fig. IF must resolve into a tandem linear 
array before the next mitosis. This may well 
occur by homologous recombination between 
any one of several repeated sequences within 
the amplified domain (45, 65). Part of the re- 
combinations would lead to extrachromosomal 
circles possessing an origin for replication {16, 
62); these could be the precursors of dmin:s. 
The unequal recombinations mean that the 
resolved linear structure consists of tandemly 
repeated but heterogeneous units. According 
to Axel's model a gradient of amplification is 
formed so that centrally located sequences are 
amplified more than sequences distal to the 
origin of replication (65). This also has, in fact, 
been found to explain the large, complex DNA 
domain amplified in neuroblastoma cells in 
vivo (38, see also below). 

The chromosomal site of integration of 
transfected genes significantly affects the fre- 
quency and cytogenetic result of their experi- 
mentally induced amplification (83). The 
amplification frequency in some transfor- 
mants has been found to be 100-fold that of 
the others (83). This suggests that there also 
are preferred chromosomal positions for am- 
plification of host cellular genes and that chro- 
mosomal rearrangements may facilitate gene 
amplification by positioning chromosomal se- 
quences in a favorable array. In respect of the 
structural properties of the sequences involved 
in gene amplification, recombinatorially active 
regions have been implicated in experimental 
cases. DNA rearrangements involving restric- 
tion fragment length polymorphisms and 
variation in gene copy number have been de- 
tected in the human genome between clusters 
of short repetitive interspersed DNA sequences 
called Alu family DNA-sequences (17). Such 
inter-Alu sequences have also been detected 
in an extrachromosomal DNA form, including 
covalently closed circles" (17, 78). The copy 
number of inter-Alu sequences apparently va- 
ries in an age- and tissue-specific manner (17, 
78), but any comprehensive analysis of the 
phenomenon in human tumours is not yet 
available. It is also not yet clear whether these 
kinds of repetitive sequences are involved in 
generating amplified oncogene sequences in 
dmims or HSR:s in tumours. 
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Fia 8. A. Indirect immunofluorescence for p2l<«"> in Yl 
DM ceils. Similar fluorescence of the plasma membrane 
was obtained for the Yl HSR cells. Inset |b| shows control 
staining with normal rat serum and inset Id staining of 
normal rat kidney cells with the monoclonal antibody 
against p2l. 



NRS anti-p21 

Fig. 7, Elevated levels of the pll*™** protein in Yl cells 
(74) The Yl DM and HSR cells which harbor a 50-fold 
amplified c-Ki-ras oncogene [74] and control cells were 
labeled with pSJ-methionine and the p21< *;« protein 
was immunoprecipitated, as detailed (74). with normal 
rat serum iNRS) or rat monoclonal anti-p21 serum, lhe 
proteins were electrophoresed in a 15 % polyacrylamide 
gel in the presence of SDS. In addition to a major p21 
. band, a labeled band at about 16 kd was present in the 
immunoprecipitates. The amount of radioactivity in p21 
i. was about 50 fold that in normal rat kidney cells. The 
; Kristen sarcoma virus-transformed rat kidney cells (ob- 
j tained from the American Tissue Culture Collection) also 
•yielded unexpectedly low amounts of the v-Ki-ras 
p protein. 



CARCINOGEN-INDUCED GENE 
AMPLIFICATION AND CLONAL 
SELECTION OF CANCER CELLS 

[Although ceil sorting experiments have 
.shown a basal spontaneous rate of gene ampli- 
fication in eukaryotic cells (37), this can be 
increased severalfold by metabolic inhibitors 
°r cytotoxic agents (15, 37, 70, 81, 85). In 
I many respects the latter response is reminis- 
' c ent of the so-called SOS-response elicited in 
Wteriae by noxious stimuli (see 28). In a te- 
kological context, the rapid induction of gene 
amplification that apparently occurs frequent- 
ly through extrachromosomal intermediates 
toay provide cells with genetic material for 
Subsequent selective pressures operating in 
Wmful conditions (60). In cancer cells, the 
Mechanism may enhance the emergence of 



clonal populations of cells with increasingly 
malignant properties (58, 60). Such genetic 
instability of cancer ceils is clearly enhanced, 
leading to the rapid evolution of increasingly 
malignant tumour cell populations (19, 58). A 
serious question of practical importance is 
whether drug resistance in treated patients 
also selects cells that have an enhanced ability 
to amplify (onco)genes important for growth 
and progression of the tumour (84, 85). It is 
also possible that some of the carcinogenic in- 
sults caused by mutagens are only expressed 
as a result of subsequent amplification events 
induced by tumour promoters (84, 85) or facil- 
itated by hormones in, say replicating epithelial 
cells (10). The persistence of dmims in some 
tumours suggests that there is a selection 
pressure for their retention (8, 9, 11, m). 
Amplified DNA in dmin:s must contain an 
origin for DNA replication (62) and must be 
selected for in daughter cell populations, 
where it is unevenly segregated (71 1. In the 
' absence of such a selection pressure dmm:s 
are lost (71). In at least one study the length of 
an HSR has been found to increase during a 
selection of malignant cells for enhanced 
tumourigenicity (30). 

The amplified c-erbB gene in A431 cells codes 
for epidermal growth factor receptor (27). The 
abundant amounts of receptor protein on 
A431 cell surface may, however, provide the 
cells with an abnormal growth response (31). 
A naive supposition is that the amplified se- 
quences in dmin:s and possibly in HSR:s of 
tumours contain growth-promoting genes (see 
36 for references). This seems to fit well with 
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Fig. 9. Construction of a v~myc expression vector. A synthetic linker (ATGAGAATTCATCATJ containing a translations! B 

initiation codon was inserted downstream from the trp promoter in the pSRC ex 16 RI expression vector described M 

previously (see ref. 3J. Approximately one "half of the v-src sequences coding for the aminoterminal portion of ppSOT** .- m 

protein were then deleted and the remaining portion ligated in translational codon frame with the synthetic ATG. A j 

Hinc II fragment of v-wiyc from plasmid clone MC 38 (nucleotides 320—685 in the v«myc sequence in ref. 2) was ligated ! 

downstream form remaining v-src sequences in continuity with its reading frame. The resulting product contained 3 I |JJ 

amino acids from the synthetic linker, 252 amino acids encoded by the 756 base pair fragment from Sma I to Pvu II / ? * 
restriction sites in v-src DNA. 122 amino acids from the v*myc and 6 amino acids (corresponding to nucleotides 

2968-2085) from the pBR322 vector (3). 
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i recent findings on amplified oncogenes, though 
j in many cases the search for an amplified on- 
cogene is still continuing. Even positive find- 
j i n gs do not mandate a role for amplified cel- 
Jlular oncogenes, however, because the do- 
| m ain of amplified DNA is inevitably much 
I larger than a single genetic locus (e.g. 38). 



ENHANCED EXPRESSION OF AMPLIFIED 
ONCOGENES 

In all cases where they have been studied, the 
amplified oncogenes have been found abund- 
antly expressed at the RNA level, roughly in 
proportion to the amount of DNA amplifica- 
tion (see Table 1). Described cases of elevated 
RNA expression include examples of abnor- 
mal (5, 22) and ectopic (6) transcription. In at 
least four cases this enhancement is not li- 
mited to synthesis of RNA (31, 33, 41, 74, 82). 
The Yl cells that have amplified c-Ki-ras con- 
lain exceptionally large amounts of its protein 
product situated on the plasma membrane 
(ref.* 74, Fig. 7 and 8). High amounts of the 
c-myc encoded protein are also found in 
COLO 320 cells that have amplified the gene 
(33). The myc oncogenes have recently been 
shown to encode nuclear proteins (ref. 1, 3, 
26, 29, 32, 33, Fig. 9-11). Both the expression 
of the c-myc mRNA (39) and the subcel- 
lular localization of myc proteins are linked to 
the cell cycle (ref. 89, Fig. 12). It may be that 
elevated expression of specific c-myc func- 
tions is necessary for cell cycle progression 
and the growth transformation aspect of the 
phenotype of cancer cells that may contribute 
to tumour progression (7, 36). Elevated ex- 
pression of c-myc has been shown to replace 
in part platelet-derived growth factor in in- 
duction of competence for DNA replication 
l 7 )- Generally, enhanced expression of an on- 
cogene could be a necessary prerequisite for 
Requisition of a growth advantage by cells 
ha ving extra copies of the gene. This effect 



+ + + INDUCTION 



43.000+ 




Fig. 10. E. coli 294 was iransfected with the hybrid v-src 
v-?nyc plasmid outlined in Fig. 9 and ampicillin-resistant 
bacterial colonies were checked for the production of a 
43,000 m.w. bacterial v-myc protein after induction by 
growth to different optica] densities in minimal essential 
medium (M9, induction +) or complete medium (LB, in- 
duction—) (3). 



could also be the principal contribution of 
amplification to tumourigenesis. 



TUMOUR CELL AND STAGE SPECIFICITY 
OF ONCOGENE ACTIVATION AND 
AMPLIFICATION 

Tumour cell specificity of oncogene amplifi- 
cation has been found in three malignancies. 
The c-myc, L-myc or N-myc oncogene is ampli- 
fied in most cases of the variant form of small- 
cell lung cancer cells (53, 69), c-er&B is ampli- 
fied in several glioblastomas (Josef Schlessin- 
ger f personal communication) and the putative 
N-myc oncogene is amplified in about half of 
grade III and IV neuroblastomas (14, 72, 73, 
75). In addition to HSR:s, small-cell lung can- 
cers and neuroblastomas frequently show 
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DAPI 




Fig. X2. Fluorescent staining for DNA and myc protein in 
myelocytomatosis virus-transformed quail cells. In in- 
terphase cells, the myc protein is confined to the nucleae. 
In the mitotic cell, myc fluorescence is distributed 
throughout the cell unlike fluorescence for chromatin, 
which is compacted to chromosomes in the metaphase 
plate. In fact, there is less myc fluorescence associated 
with chromatin than with the rest of the cell. DAPI, dia- 
minophenyl indole DNA stain. The anti-myc protein rab- 
bit antiserum was used in a 1/200 dilution (ref. 89]. 



deletion of a portion of the short arm of chro- 
mosome 1 (13) and chromosome 3 (86, 87), 
respectively, in karyological examination. 
Two kinds of changes have also been described 
in different neuroblastoma oncogenes. The 
first is a mutation in the N-ras gene, an ac- 
tivated oncogene that was discovered because 
of its relation to other ras genes and trans- 
forming activity in transfectton .experiments 
(77). The second is amplification of a distant 
homologue of the c-myc gene called N-myc (72, 
73, 75). Although the transforming potential 
of the ti-myc gene has not yet been established, 
its consistent presence in a core segment of 
amplified neuroblastoma DNA [38, 57, 72, 73, 
75) and its elevated expression in most reti- 
noblastomas (48) suggests its oncogenic nature. 

Taya et al. (80) have recently described a 
human lung giant cell carcinoma grown in 
nude mice, where both c-Ki-ras and c-myc on- 



cogenes were amplified about 10-fold. Besides, 
sequencing studies indicated that at least 
some of the amplified c-Ki-ras copies were 
also mutationally activated in the 12th codon. I 6. 
These results fit to the multistage theory of 
cancer development and progression (see 58). I 
Apparently co-operating lesions in cellular on- f 
cogenes accumulate during tumour growth ■ 7. 
and selection and increase the malignant po- 
tential of the tumour cells (44). 

When does oncogene amplification come in- 
to play during tumourigenesis? Gene amplifi- 
cation may not be any initiating event in carci- t 9 - 
nogenesis. Amplification and enhanced ex- 
pression of c-myc and N-myc may occur during in. 
the progression of human carcinoma of the 
lung and neuroblastoma cells to a more malig- 
nant phenotype (14, 53, 73). There may be, 
however, no mandatory sequence of onco- I u. 
gene amplifications for the genesis of any par- 1 
ticular tumor. Amplification of an oncogene I 
could play its part in malignant progression of 
already initiated cells whenever it happened 12. 
to occur. 
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Abstract 

The human epidermal growth factor (EGF) receptor is known 
to be homologous to the v-*rb B oncogene protein of the avian 
erythroblastosis virus. Overexpression of the EGF receptor 
gene in A431 epidermoid carcinoma cells is due to gene 
amplification. In this study, a variety of squamous ceil carci- 
nomas were examined and one, SCC-15, contained high levels 
of the EGF receptor as determined by unmnnopredpitation via 
an EGF receptor-specific polyclonal antibody. Using a cloned 
EGF receptor complementary DNA as a probe, the level of 
EGF receptor RNA was found to be elevated four-fold in SCC- 
15 relative to normal cultured kerati oocytes. When the same 
probe was used to identify EGF receptor gene fragments on a 
genomic DNA blot, the SCC-15 cell line was shown to possess 
an EGF receptor gene copy number amplified four to five 
times. Gene amplification results in the enhancement in the 
level of the EGF receptor in several carcinomas and could be 
responsible for the appearance of the transformed phenotype 
in these cells. 

Introduction 

The epidermal growth factor (EGF) 1 stimulates growth and 
elicits a wide variety of rapid and delayed responses by binding 
to high-affinity cell-surface receptors which are 170-kD glyco- 
proteins (I). Recently, EGF receptor peptides have been se- 
quenced and found to be homologous to the avian erythro- 
blastosis virus erb B oncogene product (2), suggesting that the 
EGF receptor gene is the human oerb B oncogene. A431 
epidermoid carcinoma cells possess a very large number of 
EGF receptors (3), and the EGF receptor gene is amplified 
~30-fold (4-6). This amplification is responsible for the 
overcxpression of the EGF receptor protein in these cells 
(4-6). 

A cell culture system has been developed permitting serial 
cultivation of keratinocytes, whose growth is modulated by 
EGF (7). Such methods have been used to establish cell lines 
from squamous cell carcinomas of the oral epithelium (8). 
Because of the role of EGF in keratinocyte development, we 
quantified EGF receptor protein and RNA in several squamous 
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cell carcinomas. One cell line, SCC-15, was found to contain 
high amounts of receptor protein and RNA, and a four- to 
fivefold amplification of the gene. 

Methods 

The squamous cell carcinomas established by Rheinwald and Beckett 
(8) were obtained from, and maintained according to the American 
Type Culture Collection (Rockville, MD). 1623 was originally designated 
as SCC-15; 1628 as SCC-25; and 1629 as SCC-9 (8). Normal human 
esophageal epithelial cells were grown as reported (9). Maintenance of 
other cell lines was as described elsewhere (10). Proteins were labeled 
with l"S]methionine and immunoprecipitated as previously described 
(10). Po!yA+ RNA was isolated by guanidine isothiocyanate solubili- 
zation and CsQ centrifugation (II), and oligo(dT)-affinity chromatog- 
raphy. RNA (Northern) Wotting was performed as described (II, 12). 
High molecular weight DNA was isolated (4) and analyzed by DNA 
(Southern) blotting (4, 10, 13). The EGF receptor complementary 
DNA (cDNA) clone pE7 was constructed and isolated from an A43I 
cDNA library (II), DNA fiagraents were "P-labeled by nick translation. 

Results 

A large number of cell lines were initially screened for EGF 
receptor levels by determining their ability to be killed by an 
EGF-pseudomonas exotoxin conjugate, a technique described 
previously (10). Several squamous cell carcinomas were found 
to be relatively sensitive to the EGF-toxin conjugate, including 
SCC-25, SCC-9, and particularly SCC-15, all derived from the 
human tongue (8). These three cell lines were labeled with 
[^methionine, and their extracts immunoprecipitated with 
a goat polyclonal antibody to the EGF receptor, affinity- 
purified as described (10). When compared with A431 cells, 
which make very large amounts of the EGF receptor, SCC-25 
and SCC-9 make moderate amounts and SCC-15 high amounts 
of the receptor (Fig. 1, lane a vs. e. g, and c). Quantitation of 
the immunoprecipitation data revealed that SCC-15, SCC-25, 
and SCC-9 make 41, 15, and 4% of the amount of EGF 
receptor made by A431 cells, respectively. 

Because SCC-15 cells had high levels of receptor, polyA+ 
RNA was isolated from these cells, electrophoretically fraction- 
ated on agarose, and analyzed by RNA (Northern) blotting. A 
cloned A431 cDNA (pE7) encoding the EGF receptor (11) 
was 3I P-labeled and used as a hybridization probe to visualize 
EGF receptor RNAs. Fig. 2 A shows that SCC-15 contains 
both the 10- and 5.6-kilobase species of EGF receptor RNA. 
The levels are approximately four- to fivefold higher than 
those found in either KB or A498 kidney carcinoma cells; 
these cell lines were previously found to possess readily de- 
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testable levels of both receptor RNAs (10). Fig. 2 B shows that 
cultured human epithelial cells contain EGF receptor-specific 
RNA (HEIA, lane e) whose levels are higher than an early 
passage human fibroblast D551 (lane/), equivalent to A498 
(lane g)> but much lower than SCO 15 (lane b). 

To determine if an elevated gene copy number was asso- 
ciated with enhanced expression of the EGF receptor gene in 
SCC-15 cells, genomic DNA was isolated from normal cultured 
epithelial cells (HEIA) and SCC-15 cells, digested with Hindlll, 
electrophoretically fractionated, and subjected to DNA blotting 
analysis. An EGF receptor cDNA (pE7) was used as a hybrid- 
ization probe to identify receptor DNA fragments. Fig. 3 A 
reveals that the SCC-15 genome contains four- to fivefold 
amplified EGF receptor gene sequences relative to normal 
epithelial cells (lane a vs. b). Analysis of 0-actin gene fragments 
on the same filter by hybridization to a chick actin cDNA 
probe indicated that equal amounts of DNA were loaded per 
well (data not shown). Digested SCC-15 DNA had to be 
diluted about fourfold (Fig. 3 B, lane e) to approximate the 
signal intensity of receptor DNA fragments from SCC-25, 
SCC-9. and KB cells (lanes The KB cell EGF receptor 
gene is known not to be amplified (10). 

Discussion 

It may be significant that A431 carcinoma cells are not unique 
in their possession of amplified EGF receptor genes. We report 
here that the EGF receptor gene in squamous cell carcinoma 
SCC-15 is amplified four- to fivefold relative to normal epithelial 
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Figure 2. RNA blot analy- 
sis of polyA+ RNAs using 
the "P-labded EGF recep- 
tor cDNA probe pE7. (A) 
and (£) arc autoradiographs 
from two separate gels. 
Sizes are in kilobases (left). 
5 (lanes a, b t d. and g) or 
10 (lanes c, e, and/) pg of 
RNA were loaded. 
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Figure S. DNA blot analysis of tf/ffd Ill-digested genomic DNAs 
using the pE7 probe (see Fig. 2). (A) and {B) are autoradiographs 
from two separate gels. Sizes are in kilobase pairs (right). 10 /ig of 
DNA was loaded except in (B)> lanes d-f, which represent serial 
dilutions of the 1623 DNA shown in lane c. 



cells. The amplification of the EGF receptor gene may cause 
the initiation or maintenance of the malignant state in some 
human cells. 

Previously, we reported that a variety of transformed cell 
lines synthesize relatively high amounts of both EGF receptor 
protein and messenger RNA (10). It is conceivable that a 
moderate or even a small increase in the level of the EGF 
receptor leads to a change in the cellular phenotype, as has 
been demonstrated for the sre gene product (14). If this 
hypothesis is correct, then even a minor amplification of the 
EGF receptor gene copy number could contribute to the onset 
of tumorigenesis. Hendler and Ozanne (15) have examined 
lung squamous cell carcinomas and found that they contain a 
2.5-5-fold increase in EGF receptor levels. 
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Genome-wide Study of Gene Copy Numbers, 
Transcripts, and Protein Levels in Pairs of 
Non-invasive and Invasive Human Transitional 
Cell Carcinomas* 

Torben F. 0mtoft*§, Thomas ThykjaerU, Frederic M. Waldman||, Hans Wolf**, 
and Julio E, Cells** 



Gain and loss of chromosomal material is characteristic 
of bladder cancer, as weO as mafignant transf ormafion In 
general. The consequences of these changes at both the 
transcription and translation levels Is at present unknown 
partly because of technical firnftations. Here we have at- 
tempted to address this question In pairs of noninvasive 
and Invasive human Madder tumors using a combination 
of technology that Included comparative genomic hybrid- 
ization, high density oligonucleotide array-based monitor- 
ing of transcript levels {5600 genesk and high resolution 
two-dimensional gel electrophoresis. The results showed 
that there Is a gene dosage effect that in some cases 
superimposes on other regulatory mechanisms, This ef- . 
feet depended (p < 0.015* on the magnitude of the com- 
parative genomic hybridization change, in general (18 of 
23 cases), chromosomal areas with more than 2-fold jaln 
of DNA showed a corresponding increase in mRNA tran- 
scripts. Areas with loss of DNA, on the other hand, 
showed either reduced or unaltered transcript levels, Be- 
cause most proteins resolved fay two-dimensional gets 
are unknown it was only possible to compare mRNA and 
protein alterations In relatively few oases of well focused 
abundant proteins!, 3£Hh few exceptions we found a good 
correlation (p < 0.005) between transcript alterations and 
protein levels. The Implications, as weB as limitations, 
of the. approach are discussed. Molecular & Cellular 
Proteomics 1:$7-45, 2002. 

■4 ■ . - . 

. Aneuploldy Is a common feature of most human cancers 
(1), but liWe is known about the genome-wide effect of this 
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phenomenon at both the transcription and translation Java Is. 
High throughput array studies of the breast cancer cell line 
BT474 has suggested that there Is a correlation between 
DNA copy numbers and gene expression in highly amplified 
areas (2), and studies of Individual genes in solid tumors 
have revealed a goetf correlation between gene dose and 
mRNA or protein levels in the case of c-erb-82, cyciln d1, ' 
emsf , and N-myc (3-5). However, a high cyciln 01 protein 
expression has been observed without simultaneous am- 
plification (4), and a low level of c-myc copy number in- 
crease was observed without concomitant c-myo protein 
overexpresslon (&U 

In human bladder tumors, karyotyping, fluorescent In situ 
hybridization, and comparative genomic hybridization (CGH) 1 
have revealed chromosomal aberrations that ©earn to be 
characteristic of certain stages of disease progression. In the 
case of non-Invasive pTa transitional cell camlnornas (TCCs), 
this Includes loss of chromosome 9 or parts of rt, as weO as 
loss of Y In mates. In mlnlmaJly Invasive pTt TCCs, the fol- 
lowing alterations have been reported: 2q~, 11p~, 1q+, 
11q13+, 17q+, ano* 20q+ (7-12). It has been suggested that 
these regions .harbor tumor suppressor genes and onco- 
genes; however, the large chromosomal areas Involved often 
contain many genes, making meaningful predictions of the 
functional consequences of losses and chains jyery difficult 

In this investigation wehave combined gehome-wide tech- 
nology for detecting genomic gains and losses (CGH) with 
gene expression profiling techniques (microarrays and pro- 
teomics) to determine the effect of gene copy number on 
transcript and protein levels In pairs of non-invasive and In- 
vasive human bladder TCCs, 

EXPERIMENTAL PROCEDURES 

Material- Bladder tumor biopsies, were sampled after Informed 
consent was obtained and after removal of tissue for routine pathol- 
ogy examination. By ll&ht microscopy tumors 335 and 532 were 
staged by an experienced pathologist ae pTa (superficial papillary), 

1 The abbreviations used are: CGH, comparative genomic hybrid- 
ization; TCQ transitional cell carcinoma: LOK, toss of heterozygosity; 
PA-FABP, psoriasis-associated fatty acid-binding' protafn; 20, 
two-dimensional. 
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grade I and U, respectively, tumors 733 and 827 were staged as pTt 
(Invasive Into submucosa), 733 was staged as solid, and 827 was 
staged as papillary, both grade III. 

mRNA Preparation -Tissue biopsies, obtained fresh from surgery 
were embedded Immediately In a sodium-guanldinlum thlocyanate 

RNAzol IB RNA isolation method (WAK^Chemie Medical GMBH). 

crtNA Preparaffon-1 M g 0 f mRNA was used as starting material. 
^l 3000 ^ 6trand ^thesis was performed using the 
&uperScrfpt<g> choice system (Invltrogen) according to the manufac- 
turer's Instructions but using an oligofdT) primer containing a T7 RNA 
polymetase binding site. Labeled cRNA was prepared using the ME- 
GAscrlp® m vitro transcription kH (Ambion). Biotln-!abeled CTP and 



UTP (Enzo) was used, together with unlabeled NTPs in the reaction. 
Following the in vitro transcription reaction, the unincorporated nu- 
cleotides were removed using RNeasy columns (Qiagen). 

Array Hybridization and Scanning i- Array hybridization and scan- 
nlng was modified from a previous method (13). 10 ^tg of cRNA was 
fragmented at 94 °C for 35 min in buffer containing 40 mM Tris 
acetate, pH 8.1 , 1 00 mM KOAc. 30 mM MgOAc. Prior to hybridization 
the fragmented cRNA in a 6x SSPE-T hybridization buffer (1 m NaCI* 
10 mM Tris, pH 7.6, 0.005% Triton), was heated to 95 *C for 5 min' 
subsequently cooled to 40 *C. and loaded onto the Affymetrlx probe 
array cartridge. The probe array was then incubated for 16 hat 40 *C 
at constant rotation (60 rpm). The probe array was exposed to 10 
washes In 6x SSPE-T at 25 *C followed by 4 washes In 0.5X SSPE-T 
at 50 *C. The biotlnylated cRNA waa stained with a streptaMtfin- 
phycoerythrin conjugate, 10 *ig/ml (Molecular Probes) in 6x SSPE-T 
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for 30 min at 25 *C followed; by 1 0 washes in 6x SSPE-T at 25 *C. The 
probe arrays were scanned at 560 nm using a confocal laser scanning 
microscope (made for Asymetrix by Hewlett-Packard). The readings 
from the quantitative scanning were analyzed by Affymetrix gene 
expression analysis software. 

MicwsateJfite Analysis - MicrosatelMe Analysis was performed as 
described previously (14). Microsatetlttes were selected by use of 
www^cbi-nlm.nih.gov/geoemap98, and primer sequences were ob- 
tained from the genome data base at www.gdb.org. DNA was extracted 
from tumor and Wood and amplified by PCR In a volume of 20 ^ for 35 
cycles. The ampflcons were denatured and etectrophoresed for 3 h In an 
ABI Prism 377. Data were collected in the Gene Scan program for 
fragment analysis. Loss of heterozygosity was defined as less than 33% 
of one allele detected in tumor amplteons compared with Wood. 

Proteomic Analysis-TCCs were minced Into small pieces and 
homogenized in a small glass homogenizer In 0.5 ml of lysis solution 
Samples were stored at -20 °C until use. The procedure for 2D gel 
electrophoresis has been described In detail elsewhere (15, 16). Gels 
were stained with silver nitrate and/or CcornassJa Brilliant Blue, Pro- 
teins were Identified by a combination of procedures that included 
microsequencing, mass spectrometry, two-dimensional gel Western 
immunoblotting, and comparison with the master rv^lmensfonal gel 
Image of human keratlnocyte proteins; see biobase.dk/cgi-btn/cefis. 

CGH- Hybridization of differentially labeled tumor and normal DNA 
to normal metaphase chromosomes was performed as described 
previously (10). Ruorescelrvlabeled tumor DNA (200 ng), Texas Red- 



labeled reference DNA <200 ng), and human Cot-1 DNA (20 m9) were 
denatured at 37 °C for 5 mln and applied to denatured normal met- 
aphase slides. Hybridization was at 37 "C for 2 days. After washing, 
the slides were counterstained with 0.15 jig/ml 4 r 6-diamldino-2-phe> 
nyfindole «n an anti-fade solution. A second hybridization was per- 
formed for ail tumor samples using fluorescelrHabeled reference DNA 
and Texas Red-labeled tumor DNA (inverse labeling) to confirm the 
aberratlpns detected during the initial hybridization. Each CGH ex- 
periment also Included a normal control hybridization using fluores- 
cein- and Texas Red-labeled normal DNA. Digital knags analysis was 
used to identify chromosomal regions with abnormal fluorescence 
ratios, indicating; regions of DNA gains and losses. The average 
greencred fluorescence Intensity ratio profiles were calculated using 
four images of each chromosome (eight chromosomes total) with 
normalization of the green:red fluorescence Intensity ratio for the 
entire metaphase and background correction. Chromosome identifi- 
cation was performed based on 4.6^diamidino-2-phenyUndole band- 
ing patterns. Only images showing uniform high Intensity fluores- 
cence with minimal background staining were analyzed. All 
centromeres, p arms of acrocentric chromosomes, and heterochro- 
matic regions were excluded from the analysis. 

RESULTS 

Comparative Genomic Hybridization-lbe CGH analysis 
identified a number, of chromosomal gains and losses in the 
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Tabu I 

Correlation between attentions detected by CGH and by expression monitoring 

Top CGH used as Independent variable (if CGH alteration - what expression ratio was found); bottom, altered expression used as 
independent variable fit expression alteration - what CGH d eviation was found). 

Tumor 733 Vs. 335 



CGH alterations 



Expression change dusters 



Concordance CGH alterations 



13 Gain 



10 Loss 



10 Up-regulatkm 

0 Down-regulation 

3 No change 

1 Up-regulation 

5 Down-regulation 

4 No change 



Tumor 827 vs. 532 
Expression change dusters 



Concordance 



77% 



50% 



10 Gain a Up-regulatlon 

0 Down-regulation 
2 No change 

12 Loss 3 Up-regulation 

2 Down regulation 
7 No change 



Expression change clusters 



Tumor 733 vs. 335 



CGH alterations Concordance Expression change dusters 



Tumor 827 vs. 532 
CGH alterations 



17% 



Concordance 



16 Up-regulation 
21 Down-regulation 
15 No change 



11 Gain 

2 Loss 

3 No change 
1 Gain 

8 Loss 

12 No change 
3 Gain 

3 Loss 

9 No change 



69% 
38% 
60% 



17 Up-fegulation 
9 Down^regulation 
21 No change 



10 Gain 

5 Loss 

2 No change 

0 Gain 

3 Loss 

6 No change 

1 Gain 
3 Loss 

17 No change 



59% 
33% 
81% 



two Invasive tumors {stage pT1, TCCs 733 and 827), whereas 
the two non-invasive papillomas (stage pTa, TCCs 335 and 
532) showed only 9p- t 9q22-q33-, and and 7+, 9q-, 
and respectively. Both invasive tumors showed changes 
<1q22-24+, 2q14.1-qter-, 3q12-q13.3- 6q12-q22-~, 
9q34+, 11q12-q13+, 17+, and 20q11.2-q12+) that are typ- 
ical for their disease stage, as well as additional alterations, 
some of which are shown In Fig. 1. Areas with gains and 
losses deviated from the normal copy number to some extent 
and the average numerical deviation from normal was 0,4-fold 
in the case of TCC 733 and 0.3-fold for TCC 827- The largest 
changes, amounting to at least a doubling of chromosomal 
content were observed at 1q23 in TCC 733 (Fig. \A) and 
20q12 in TCC 827 (Rg. 18). 

mRNA Expression in Relation to DNA Copy Number- The 
mRNA levels from the two invasive tumors (TCCs 827 and 
733) were compared with the two non-invasive counterparts 
(TCCs 532 and 335). This was done in two separate experi- 
ments in which we compared TCCs 733 to 335 and 827 to 
532, respectively, using two different scaling settings for the 
arrays to rule out scaling as a confounding parameter. Ap- 
proximately 1,800 genes that yielded a signal on the arrays 
were searched in the Unigene and Genemap data bases for 
chromosomal location, and those with a known location 
(1096) were plotted as bars covering their purported locus. In 
that way it was possible to construct a graphic presentation of 
DNA copy number and relative mRNA levels along the indi- 
vidual chromosomes (Fig. 1). 

For each mRNA a ratio was calculated between the level In 
the Invasive versus the non-Invasive counterpart, Bars, which 
represent chromosomal location of a gene, were color-coded 
according to the expression ratio, and only differences larger 



than 2-fold were regarded as informative (Rg. 1), The density 
of genes along the chromosomes varied, and areas contain- 
ing only one gene were excluded from the calculations. The 
resolution of the CGH method is very low, and some of the 
outlier data may be because of the fact that the boundaries of 
the chromosomal aberrations are not known at high resolution- 
Two sets of calculations were made from the data. For the 
first set we used CGH alterations as the independent variable 
and estimated the frequency of expression alterations In these 
chromosomal areas. In general, areas with a strong gain of 
chromosomal material contained a cluster of genes having 
increased mRNA expression. For example, both chromo- 
somes 1q21-q25, 2p arid 9q, showed a relative gain of more 
than 100% In DNA copy number that was accompanied by 
increased mRNA expression levels in the two tumor pairs (Fig. 
1). In most cases, chromosomal gains detected by CGH were 
accompanied by an increased level of transcripts In both 
TCCs 733 (77%) and 827 (80%) (Table I, top). Chromosomal 
losses, on the other hand, were not accompanied by de- 
creased expression in several cases, and were often regis- 
tered as having unaltered RNA levels (Table I, fop). The inabil- 
ity to detect RNA expression changes in these cases was not 
because of fewer genes mapping to the lost regions (data not 
shown). 

In the second set of calculations we selected expression 
alterations above 2-fold as the Independent variable and es- 
timated the frequency of CGH alterations in these areas. As 
above, we found that Increased transcript expression corre- 
lated with gain of chromosomal material (TCC 733, 69% and 
TCC 827, 59%), whereas reduced expression was often de- 
tected in areas with unaltered CGH ratios (Table I, bottom). 
Furthermore, as a control we looked at areas with no atter- 
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atlon in expression. No alteration was detected by CGH In 
most of these areas (TCC 733, 60% and TCC 827, 81 %; see 
Table I, bottom). Because the ability to observe reduced or 
Increased mRNA expression clustering to a certain chromo- 
somal area clearly reflected the extent of copy number 
changes, we plotted the maximum CGH aberrations In the 
regions showing CGH changes against the ability to detect a 
change in mRNA expression as monitored by the oligonucleo- 
tide arrays (Rg. 2)CEor both tumors TCC 733 (p < 0.015) and 
TCC 827 (p < 0.00003) a highly significant correlation was 
observed between the level of CGH ratio change (reflecting 
the ONA copy number) and alterations detected by the array 
based technology Fig. 2% Similar data were obtained when 
areas with altered expression were used as Independent vari- 
ables. These areas correlated best with CGH when the CGH 
ratio deviated 1.6- to 2.0-fold (Table I, bottom) but mostly did 
not at lower CGH deviations. These data probably reflect that 
loss of an allele may only lead to a 50% reduction In expres- 
sion level, which is at the cut-off point for detection of expres- 
sion alterations. Gain of chromosomal material can occur to a 
much larger extent. 

Miavsatelftte-based Detection of Minor Areas of Loss- 
es— In TCC 733, several chromosomal areas exhibiting ONA 
amplification were preceded or followed by areas with a nor- 
mal CGH but reduced mRNA expression (see Fig. 1, TCC 733 
chromosome 1q32, 2p21, and 7q21 and q32, 9q34 r and 
10q22). To determine whether these results were because of 
undetected loss of chromosomal material in these regions or 



because of other non-structural mechanisms regulating tran- 
scription, we examined two micro-satellites positioned at chro- 
mosome 1q25-32 and two at chromosome 2p22. Loss of 
heterozygosity (LOH) was found at both 1q25 and at 2p22 
indicating that minor deleted areas were not detected with the 
resolution of CGH (Rg. 3). Additionally, chromosome 2p In 
TCC 733 showed a CGH pattern of gain/no change/gain of 
DNA that correlated with transcript increase/decrease/in- 
crease. Thus, for the areas showing Increased expression 
there was a correlation with the DNA copy number alterations 
(Rg. 1 A). As Indicated above, the mRNA decrease observed in 
the middle of the chromosomal gain was because of LOH, 
Implying that one of the mechanisms for mRNA down-regu- 
lation may be regions that have undergone smaller losses of 
chromosomal material. However, this cannot be detected with 
the resolution of the CGH method. 

In both TCC 733 and TCC 827, the telomeric end of chro- 
mosome 11p showed a normal ratio In the CGH analysis; 
however, clusters of five and three genes, respecOvely, lost 
their expression. Two microsatellites (D11S1760, 011S922) 
positioned close to MUC2. IGF2, and cathepsin D Indicated 
LOH as the most likely mechanism behind the loss of expres- 
sion (data not shown). 

A reduced expression of mRNA observed in TCC 733 at 
chromosomes 3q24, 11p11, 12p12.2, 12q21.1, and 16q24 
and In TCC 827 at chromosome 11p15.5, 12p11, 15q11.2, 
and 18q12 was also examined for chromosomal losses using 
microsatellites positioned as close as possible to the gene loci 
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F10. 3. Microsatellite analysis of loss of heterozygosity. Tumor 
733 showing loss of heterozygosity at chromosome 1q25, detected 
(a) by D1S215 dose to Hu class I histocompatibility antigen (gene 
number 38 in Rg. 1), (b) by D1SZ735 close to caihepsln E (gene 
number 41 in Rg. 1), and (c) at chromosome 2p23 by 02S2251 close 
to general ^spectrin (gene number 1 1 on Fig. 1) and of (o) tumor 827 
showing loss of heterozygosity at chromosome 18q12 by S18S1118 
dose to mitochondrial 3-oxoacyf-coenzyme A thlolase (gene number 
12 in Fig. 1). The upper curves show the electropherogram obtained 
from normal DMA from leukocytes (A/), and the tower curves show the 
electropherogram from tumor DNA (7). In all cases one allele Is 
partially lost in the tumor ampllcon. 

showing reduced mRNA transcripts. Only the mlcrosatellite 
positioned at 18q12 showed LOH (Fig. 3), suggesting that 
transcriptional down-regulation of genes In the other regions 
may be controlled by other mechanisms. 

Relation between Changes in mRNA and Protein Levels- 
2D-PAGE analysis, in combination with Coomassie Brilliant 
Blue and/or silver staining, was carried out on all four tumors 
using fresh biopsy material. 40 well resolved abundant known 
proteins migrating In areas away from the edges of the pH 
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Fkl 4. Correlation between protein levels as Judged by 20- 
PAGE and transcript ratio. For comparison proteins were divided In 
three groups, unaltered In level or up- or down-regulated ifiorizontaJ 
axfe). The mRNA ratio as determined by oligonucleotide arrays was 
plotted for each gene (vertical axis). A, mRNAs mat were scored as 
present In both tumors used for the ratio calculation; A. mRNAs that 
were scored as absent in the Invasive tumors (along horizontal axfe) or 
as absent in non-invasive reference (fop of figure). TWo different 
scallngs were used to exclude scaling as a confounder, TCCs 827 
and 532 (AA) were scaled with background suppression, and TCCs 
733 and 335 (#Q were scaled without suppression. Both compari- 
sons showed highly significant (p < 0.0Q5) differences in mRNA ratios 
between the groups. Proteins shown were as follows: Group A (from 
/eff), phosphoglucomutase 1, glutathione transferase class $1 number 
4, fatty acid-binding protein homologue, cytokeratln 15, and cyto- 
keratln 13; 8 (from tefl), fatty acid-binding protein homologue, 28-kDa 
heat shock protein, cytokeratln 1.3, and calcyctin; C(from tefl), cr-eno- 
lase, hnRNP B1„ 28-kDa heat shock protein, 14-3-3-^ and 
pre^nRNA splicing factor, 0, mesothelial keratin K7 (type II); £ (from 
(op), glutathione $-tran$ferase-ir and mesothelial keratin K7 (type. II); 
F(from top and left), adenyryl cyclase-associated protein, E-cadheiin[ 
keraUn 19 r calglzzarin. phosphoglycerate mutase, annexin IV, cy- 
toskeletal y-actla hnRNP A1« Integral membrane protein calnexln 
(IP90), hnRNP H, brain-type clathrin light chain-a, hnRNP F, 70-kDa 
heat shock protein, heterogeneous nuclear ribonucleoprotein A/B, 
translations controlled tumor protein, liver glyceraldehyde-3-phos- 
phate dehydrogenase, keratin 8, aldehyde reductase, and Na,K- 
ATPase 0-1 subunit; G, (from to/> and feffy TCP20. calgizzarln, 70- 
kOa heat shock protein, calnexln, hnRNP H, cytokeratln 15, ATP 
synthase, keratin 1 d, triosephosphate Isomerase, hnRNP F, liver glyc- 
eraldehyde-3-phosphatase dehydrogenase, glutathione ^-transfer- 
ase-w, and keratin 8; H (from feft), plasma gelsblin, autoantlgen cal- 
retfculin, mioredoxin, and NAD+-dependent IShydroxyprostaglandin 
dehydrogenase; / (from fop), prolyl 4-hydroxyiase 0-subunlt, cyto- 
keratln 20, cytokeratln 17, prohibition, and fructose 1,6-blphos- 
phatase; J annexin It; K, annexin IV; L (from fop and /eft), 90-kDa heal 
shock protein, prolyl 4-bydroxylase 0-subunit, a-enolase, GRP 78, 
cyclophiltn, and cofllln. 

gradient, and having a known chromosomal location, were 
selected for analysis in the TCC pair 827/532. Proteins were 
identified by a combination of methods (see "Experimental 
Procedures''). In general there was a highly significant corre- 
lation (p < 0,005) between mRNA and protein alterations (Rg, 
4). Only one gene showed disagreement between transcript 
alteration and protein alteration. Except for a group of cyto- 
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Fig. 5. Comparison of protein and transcript levels in invasive 
and non-Envaslve TCCs, The upper part of the figure shows a 2D gel 
(te/r) and the oligonucleotide array iflght) of TCC 532. The red rectah- 
gfes on the upper get highlight the areas that are compared below. 
Identical areas of 2D gels of TCCs 532 and 827 are shown below. 
Clearly, cytokeratlns 13 and 15 are strongly down-regulated in TCC 
827 (red annotation). The tile on the array containing probes for 
cytokeratin 1 5 is enlarged below the array (red arrow) from TCC 532 
and is compared with TCC 827. The upper row of squares In each tile 
corresponds to perfect match probes; the lower row corresponds to 
mismatch probes containing a mutation (used for correction for un- 
specific binding). Absence of signal is depicted as WacA, and the 
higher the signal the fighter the color. A high transcript level was 
detected in TCC 532 (6151 units) whereas a much lower level was 
detected in TCC 827 (absence of signals). For cytokeratin 13, a high 
transcript level was also present in TCC 532 (15659 units), and a 
much lower level was present in TCC 827 (623 units). The 2D gels at 
the bottom of the figure (jfefl) show levels of PA-FABP and adipocyte- 
FABP in TCCs 335 and 733 (Invasive), respectively. Both proteins are 
down-regulated in the invasive tumor. To the right we show the array 
tiles for the PA-FABP transcript A medium transcript level was de- 
tected In the case of TCC 335 (1 277 units) whereas very low levels 
were detected in TCC 733 (166 units). IEF, isoelectric focusing. 



keratins encoded by genes on chromosome 17 (Fig- 5) the 
analyzed proteins did not belong to a particular family. 26 well 
focused proteins whose genes had a know chromosomal 
location were detected In TCCs 733 and 335, and of these 1 9 
correlated (p < 0.005) with the mRNA changes detected using 
the arrays (Fig. 4). For example, PA-FABP was highly ex- 
pressed in the non-invasive TCC 335 but lost in the Invasive 
counterpart (TCC 733; see Fig. 5). The smaller number of 
proteins detected in both 733 and 335 was because of the 
smaller size of the biopsies that were available. 

1 1 chromosomal regions where CGH showed aberrations 
that corresponded to the changes In transcript levels also 
showed corresponding changes In the protein level (Table II). 
These regions Included genes that encode proteins that are 
found to be frequently altered in bladder cancer, namely 
cytokeratlns 17 and 20, annexlns II and IV, and the fatty 
acid-binding proteins PA-FABP and FBP1. Four of these pro- 
teins were encoded by genes In chromosome 17q t a fre- 
quently amplified chromosomal area In invasive bladder 
cancers. 

DISCUSSION 

Most human cancers have abnormal DNA content having 
lost some chromosomal parts and gained others. The present 
study provides some evidence as to the effect of these gains 
and losses on gene expression in two pairs of non-lnvasjve 
and invasive TCCs using high throughput expression arrays 
and proteomics. in combination with CGH. In general, the 
results showed that there is a clear individual regulation of the 
mRNA expression of single genes, which in some cases was 
superimposed by a DNA copy number effect In most cases, 
genes located in chromosomal areas with gains often exhib- 
ited increased mRNA expression, whereas areas showing 
losses showed either no change or a reduced mRNA expres- 
sion. The latter might be because of the fact that losses most 
often are restricted to loss of one allele, and the cut-off point 
for detection of expression alterations was a 2-fotd change, 
thus being at the border of detection. In several cases, how- 



Protein 



Table II 

Proteins whose expression /eve/ correlates with both mRNA and gene dose changes 



Chromosomal location 



Annexln II 
Annexln IV 
Cytokeratin 17 
Cytokeratin 20 
(PA-)FABP 
FBP1 

Plasma gelsolin 
Heat shock protein 28 
Prohifaltln 
ProlyM-hydroxyl 
hnRNPBI 



1q21 
2p13 

17q12-q21 

17q2l,1 

8q21.2 

9o22 

9q31 

15q12-q13 
17q21 
17q25 
7p15 



• Abs, absent; Pres. present. 

* In cases where Ihe corresponding alterations were 



Tumor TCC 


CGH alteration 


Transcript alteration* 


Protein alienation 


733 


Gain 


Abs to Pres a 


Increase 


733 


Gain 


3.9-Fold up 


Increase 


827 


Gain . 


3.8-Fold up 


Increase 


827 


Gain 


5.6-Fold up 


Increase 


827 


Loss 


10-Fold down 


Decrease 


827 


Gain 


2.3-Fold up 


Increase 


827 


Gain 


Abs to Pres 


Increase 


827 


Loss 


25-Fold up 


Decrease 


827/733 


Gain 


3.7-/2.5-FoW up* 


Increase 


827/733 


Gain 


5.7-/1 .6-Fold up 


Increase 


827 


Loss 


2.5-Fold down 


. Decrease 



found In both TCCs 827 and 733 these are shown as 827/733. 
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ever, an increase or decrease in DNA copy number was 
associated with cfe novo occurrence or complete loss of tran- 
script, respectively. Some of these transcripts could not be 
detected In the non-Invasive tumor but were present at rela- 
tively high levels In areas with DNA amplifications in the Inva- 
sive tumors (e.g. In TCC 733 transcript from cellular Ilgand of 
annexin II gene (chromosome 1q21) from absent to 2670 
arbitrary Units; In TCC 827 transcript from small proline-rich 
protein 1 gene (chromosome 1q12-q21.1) from absent to 
1326 arbitrary units). It may be anticipated from these data 
that significant clustering of genes with an increased expres- 
sion to a certain chromosomal area Indicates an increased 
likelihood of gain of chromosomal material In this area. 

Considering the many possible regulatory mechanisms act- 
ing at the level of transcription, it seems striking that the gene 
dose effects were so clearly detectable in gained areas. One 
hypothetical explanation may lie In the loss of controlled 
methyiation In tumor ceils (17-19). Thus, it may be possible . 
that in chromosomes with increased DNA copy numbers two 
or more alleles could be demethylated simultaneously leading 
to a higher transcription level, whereas in chromosomes with 
losses the remaining allele could be partly methylated, turning 
off the process (20, 21). A recent report has documented a 
ploidy regulation of gene expression In yeast, but In this case all 
the genes were present In the same ratio (22), a situation that is 
not analogous to that of cancer ceils, which show marked 
chromosomal aberrations, as well as gene dosage effects. 

Several CGH studies of bladder cancer have shown that 
some chromosomal aberrations are common at certain 
stages of disease progression, often occurring in more than 1 
of 3 tumors. In pTa tumors, these include 9p- t 9q~, 1 q +, Y- 
(2, 6). and ln,pT1 tumors, 2q-,t1p-, 11q-, 1q+, 5p+, 8q+, 
17q+, and 20q+ (2-4, 6, 7). The pTa tumors studied here 
showed similar aberrations such as 9p- and 9q22-q33~ and 
9q- and Y~, respectively- Likewise, the two minimal invasive 
pT1 tumors showed aberrations that are commonly seen at 
that stage, and TCC 827 had a remarkable resemblance to the 
commonly seen pattern of losses and gains, such as 1q22-24 
amplification (seen in both tumors), 1 1q14-q22 loss, the latter 
often linked to 17 q+ (both tumors), and 1q+ and 9p~, often 
linked to 20q + and 11 q1 3+ (both tumors) (7-9). These ob- 
servations Indicate that the pairs of tumors used In this study 
exhibit chromosomal changes observed in many tumors, and 
therefore the findings could be of general importance for 
bladder cancer. 

Considering that the mapping resolution of CGH is of about 
20 megabases it Is only possible to get a crude picture of 
chromosomal Instability using this technique. Occasionally, 
we observed reduced transcript levels close to or inside re- 
gions with Increased copy numbers. Analysis of these regions 
by positioning heterozygous mfcrosateHites as close as pos- 
sible to the locus showing reduced gene expression revealed 
loss of heterozygosity In several cases. It seems likely that 
multiple and different events occur along each chromosomal 



arm and that the use of cDNA mlcroarrays for analysis of DNA 
copy number changes will reach a resolution that can resolve 
these changes, as has recently been proposed. (2). The outlier 
data were not more frequent at the boundaries of the CGH 
aberrations. At present we do not know the mechanism be- 
hind chromosomal aneuploldy and cannot predict whether 
chromosomal gains will be transcribed to a larger extent than 
the two native alleles. A mechanism as genetic imprinting has 
an Impact on the expression level in normal cells and Is often 
reduced In tumors. However, the relation between imprinting 
and gain of chromosomal material is not known. 

We regard it as a strength of this investigation that we were 
able to compare invasive tumors to benign tumors rather than 
to normal urothelium, as the tumors studied were biologically 
very dose, and probably may represent successive steps in 
the progression of bladder cancer. Despite the limited amount 
of fresh tissue available it was possible to apply three different 
state of the art methods. The observed correlation between 
DNA copy number and mRNA expression is remarkable when 
one considers that different pieces of the tumor biopsies were 
used for the different sets of experiments. This indicate that 
bladder tumors are relatively hornogenous, a notion recently 
supported by CGH and LDH data that showed a remarkable 
similarity even between tumors and distant metastasis (10, 23). 

In the few cases analyzed, mRNA. and protein levels 
showed a striking correspondence although in some cases 
we found discrepancies that may be attributed to translational 
regulation, post-translational processing, protein degrada- 
tion, or a combination of these. Some transcripts belong to 
undertranslated mRNA pools, which are associated with few 
translationally inactive ribosomes; these pools, however, 
seem to be rare (24). Protein degradation, for example, may 
be very Important . In the case of polypeptides with a short 
half-life (e.g. signaling proteins). A poor correlation between 
mRNA and protein levels was found in liver cells as deter- 
mined by arrays and 20-PAGE (25), and a moderate correla- 
tion was recently repotted by Ideker etai (26) In yeast 

interestingly, our study revealed a much better correlation 
between gained chromosomal areas and Increased mRNA 
levels than between loss of chromosomal areas and reduced 
mRNA levels. In general, the level of CGH change determined 
the ability to detect a change in transcript. One possible 
explanation could be that by losing one allele the change In 
mRNA level is not so dramatic as compared with gain of 
material, which can be rather unlimited and may lead to a 
severalfold Increase in gene copy number resulting In a much 
higher impact on transcript level. The latter would be much 
easier to detect on the expression arrays as the cut-off point 
was placed at a 2-fold level so as not to be biased by noise on 
the array. Construction of arrays with a better signal to noise 
ratio may In the future allow detection of lesser than 2-fold 
alterations in transcript levels, a feature that may facilitate the 
analysis of the effect of loss of chromosomal areas on tran- 
script levels. 
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In eleven cases we found a significant correlation between 
DNA copy number, mRNA expression, and protein level. Four 
of these proteins were encoded by genes located at a fre- 
quently amplified area In chromosome 17q. Whether DNA 
copy number is one of the mechanisms behind alteration of 
these eleven proteins is at present unknown and will have to 
be proved by other methods using a larger number of sam- 
ples. One factor making such studies complicated Is the large 
extent of protein modification that occurs after translation, 
requiring imrnunoidentfficatJon and/or mass spectrometry to 
correctly Identify the proteins in the gels. 

In conclusion, the results presented In this study exemplify 
the large body of knowledge that may be possible to gather in 
the future by combining state of the art techniques that follow 
the pathway from DNA to protein (26). Here, we used a tradi- 
tional chromosomal CGH method, but in the future high reso- 
lution CGH based on mlcroarrays with many thousand radiation 
hybrid-mapped genes will Increase the resolution and informa- 
tion derived from these types of experiments (2). Combined with 
expression arrays analyzing transcripts derived from genes with 
known locations, and 2D gel analysis to obtain information at 
the post-translational level, a clearer and more developed un- 
derstanding of the tumor genome will be forthcoming. 
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ABSTRACt 

Genetic changes underlie tumor progression and may lead to cancer- 
'specific expression of critical genet, Over 110Q publications have de-.. 
; scribed the we of eomp exstive ^emotalc hybridization (CGH) to analyze 

thtpattera of copy number alterattent In cancer, but very fewtf thagenes 

affected are known. Here, we performed hig^raoJotiori CGH analysis oa 

eDNA microarrays In breast cancer and directly compared copy number * 

sutd mKNK expreaskm levels of tt^W .genes 40 qnantitate the Impact of 

genomic change* on gene exprestioiL We Identified and mapped the 

boundaries of 24 independent ampticoni, ranging in size from 02 to U 

Mb, Throughout the genome, both high- and tow-level copy number 

changes bad a substantial Impact <m gene expression. With '44% of the- 

highly amplified genes showing overexpresrion and J 03% of the highly 

ovcrexprcoed genes being amplified. Statistical analysis- with random 

permutation tests identified 270 gents whose expression ieveis across' 14 

samples were lyatematicnUy attributable to gene, amplification.- tyese 

inctoded miwt previously described ampliqed genca in breast cancer and * 

many novel targeti for genomic alterations, including the BOXB7 gene, 
. the presence of which In a novel ampllcon at 17o2L3 was validated m 
of primary breast cancers and associated vrftK poor patient prog- 

nods. In conclusion, CGH on cDNA microarrays revealed hundreds or. 

novel genes whose ovempresslan Is attributable to gene amplification, 

Thae genes may provide insights to the dona], evolution and progression 

of breast cancer and hlghUgbt promising therapeutic targets. 

INTRODUCTION 

Gene expression patterns revealed by cDNA microarrays have 
facilitated classification of cancers into biologically distinct catego- 
ries some of which may explain the clinical behavior of the tumors 
(1-6). Despite (his progress in diagnostic classification, the molecular 
mechanisms underlying gene expression patterns in cancer have re- 
mained^eiusive, and the utility of gene expression profiling in the 
itotification of specific, therapeutic targets remains Jimitel^ ** * 

• : >Uxmnnlation of genetic 'defects is thought to underlie the clonal - ^ «» ^m^^m ^ >xim cgiobsi i*» 1% of 

evolution of cancer. Identification of the genes that mediate the effects cDNA ntios > <0.4S26 (global lower 7% offce expfteaba ratios). B, percentage 
of genetic changes may be. important by highlighting transcripts that ^5? n ^ ^ ^Z?^^**"* 0 * "** * 
are actively involved in tumor progression. Such transcripts and their 

encoded proteins would be ideal targets , for anticancer therapies, as . < 

demonstrated by the clinical success of new therapies against ampli- 
fied oncogenes, such as &RBB2 and EGFX (J ,%)/m breast cancer 20. recurrent regions of DNA amplification have been mapped in 
other solid tumors. Besides ampUfications of known oncogenes oyer b W c ? ncer by CGH ^' (9, 10 )- Howevcr . amplicons are often 

; ' * . poorly defined, and their impact on gene eanjjrcssion remains 

~~ unknown. 

We hypothesized that genome-wide identification of those gene 
expression changes that , are attributable to underlying gene copy 
number alterations would highlight transcripts that are. actively in- 
. — — — vv. «*v - ^mmm rwawmon, ue volvcd in the causation or maintenance of, the malignant pbenotype, 

™en^ 

cojmeicirf-and Technical Sciences, and u«SwedUh R««rcb cS^^ ,°P H microarrays. to: (a) determine the global impact that gene copy 
can ^ ,C C donntt? <? ' M * m mUab,c ** iCwcw Rcwsreh Online (http^/ number variation plays in breast cancer deveiepmebtand progression; 
^rj«mW c4uki% to* wo*. and (ft) identify and characterize those genes, whose mRNA expres- 

^To wfcera re^ooite foe reprint* should bo addressed, at Ubcmoiy of Cancer Otxet- ■ 

fc^Kt^ 5, n» ibbn^iuom used ate: CGH, eompe«tive s^omfc bybridisstioa; PI5H. flue 

moac; ^J«7-^I25, Fax. 358-3247-11 68; B^nsih aoi^kttlbnifiTm^tsii. reicctKe to situ bybridiialk«t: RT-PCR. m^enle triuJcri|^PCR. 
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sion is most significantly associated with amplification of the corre- 
sponding genomic template. 

MATERIALS AND METHODS 

Breast Cancer Cell Unes. Fourteen breast cancer cell lines (BT-20 BT- 
474, HCCI428, Hs578t, MCF7, MDA-361, MDA-436, MDA-453, MDA-468, 
SKBR-3, T-47D, UACC8I2, ZR-75-1. and ZR-75-30) were obtained from the 
American Type Culture Collection (Manassas, VA). Cells were grown under 
recommended culture conditions- Genomic DNA and mRNA were isolated 
using standard protocols. 

Copy Number and Expression Analyses by cDNa Microarrays. The 
prepararion and printing of dte 13,824 cDNA clones on glass slides were 
pwfoirned as described (1 1-13). Of these clones, 244 represented uncfaarac- 
Prized expressed sequence tags, and the remainder corresponded to known 
genes. CGH experiments on cDNA microarrays were done as described (14, 
15). Briefly, 20 M g.of genomic DNA from breast cancer cell lines and normal 
human WBCs were digested for 14-18 h withal and*** (Life Technol- 
ogies, Inc., Rockville, MD) and purified by phenol/chloroform extraction. Six 
ug of digested cell line DMAs were labeled with Cy3-dUTP (Amersham 
Pharmacia) and normal DNA with Cy5-dUTP (Amersham Pharmacia) using 
the Biopnme Labeling kit (Life Technologies, Inc.). Hybridization (14, 15) and 
posthybndization washes (13) were done as described. For the expression 
analyses, a standard reference (Universal Human Reference RNA; Stratagenc, 
U Jolla, CA) was used in all experiments. Forty M g of reference RNA were 
labeled with Cy3-dUTP and 3.5 M g of test mRNA wilfa Cy5^UTP, and the 
labeled cDNAs wero hybridized on microarrays as described (13, 15). For both 
microarray analyses, a laser coa focal scanner (Agilent Technologies, Palo 
Alto, CA) was used, to measure the fluorescence intensities at the target 
locations using the DEARRAY software (16). After background subtraction, 
average intensities at each clone in the test hybridization were divided by the 
average intensity of the corresponding clone in the control hybridization. For 
toe copy number analysis, the ratios were normalized on the basis of the 
distribution of ratios of ail targets on the array and for the expression analysis 
on the basis of 88 housekeeping genes, which were spoued four times onto the 
array. Low quality measurements (I.e.. copy number data with mean reference 
intensity <100 fluorescent units, and expression data with both test and 
reference tntensity <100 fluorescent units and/or with spot size <50 units) 



were excluded from the analysis and were treated as missing values. The 
distributions of fluorescence ratios were used to define outpoints for iacrcascoY 
decreased copy number. Genes with CGH ratio > J.43 (representing the upper 
5% of tho CGH ratios across all experiments) were considered to be amplified, 
and genes with ratio <0.73 (representing the lower 5<K) were considered to be 
deleted. 

Statistical Analysb of CGH and cDNA Microarray DaU. To evaluate 
the influence of copy number alterations on gene expression, we applied the 
following statistical approach. CGH and cDNA calibrated intensity ratios were 
log-transformed and normalized using median centering of the values in each 
cell line. Furthermore, cDNA ratios for each gene across all 14 cell lines were 
median centered. For each gene, the CGH data were represented by a vector 
that was labeled 1 for amplification (ratio, > 1 .43)* and 0 for no amplification. 
Amplification was correlated with gene expression using the signal-to-noise 
statistics (I). We calculated a weight, w r for each gene as follows: 



w m 



where m,„ <r gX and o^, denote the means aod SDs for the expression 
levels for amplified and nonamphfied cell lines, respectively. To assess the 
statistical significance of each weight, we pertbrmed 10,000 random permu- 
tations of the label vector. The probability that a gene had a larger or equal 
weight by random permutation than the original weight was denoted by a. A 
low a (<0.05) indicates a strong association between gene expression and 
amplification. 

Genomic Localization of cDNA Clones and Amplicon Mapping. Each 
cDNA clone on the microarray was assigned to a Unigene cluster using the 
Unigene Build 141* A database of genomic sequence alignment information 
for mRNA sequences was created from the August 2001 freeze of the Uni- 
versity of Cali foruia Santa Cruz's GoldcnPath database. 7 The chromosome and 
bp positions for each cDNA clone were then retrieved by relating these data 
sets. Amplicons wero defined as a CGH copy number ratio >2.0 in at least two 
adjacent clones in two or more cell lines or a CGH ratio >2.0 in at least three 
adjacent clones in a single cell line. The amplicon start and end positions were 



* Internet address: httptfrdseairluAga^ 
Internet address: www.gcnomcucsc.edu. 
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Table I Summary tffadepmdatt ampHcotx* fit U brtnjt cancer cert llna by 
CGH microarray 



Location 


Start (Mb) 


End (Mb) 




iplJ 


132.79 


132.94 


oa 


Jq2l 


173.92 


177,25 


3J 


. Iq22 


I79.2S 


179^7 


0J 


JpM 


71,94 


74.66 


2.7 


7pl2.1-7pll.2 


55.62 


60.95 


53 


7qJl 


125.73 


(30.96 


5.2 


7q32 


140.01 


140.68 


0.7 


8q2l.ll-8q2l.l3 


86.45 


92.46 


6.0 


8q2U 


98.45 


103.05 


4 6 


$q233-«q24.H 


129.88 


142.15 


1? 3 


8q24J2 


151.21 


152,16 


1.0 


9pl3 


38.65 


39.25 


d.6 


I3q22-q3| 


77.15 


8138 


4.2 


16q22 


86.70 


87.62 


0.9 


17qtl 


29.30 


30.85 


1.6 


l7c.l2-q2L2 


39.79 


42.80 


3.0 


l?q2l32H}2l.33 


52.47 


55.80 


3.3 


17q22-i l 23J 


63.81 


69.70 


5.9 


(7q233-^24.3 


69.93 


74.99 


5.1 


19qI3 


40.63 


41.40 


0.8 


20q 11.22 


3449 


35,85 


1.3 


20q.l3.IZ 


44.00 


45.62 


1.6 


20ql3.l2^I3.l3 


46.45 


49.43 


3.0 


20^13^13.32 


5IJ2 


59.12 


7.8 



extended to include neighboring nonamplified ctoocs (ratio, <l.5> The am- 
pticon size termination was partially dependent on local clone density. 

FfSH. Dual-color interphase FISH to breast cancer cell lines was done as 
described (17). Bacterial artificial chromosome clone RPU-361K8 was la- 
beled with SpcctnimOrange (Vysis, Downers Grove, IL), and Spectrom- 
Orange-labelcd probe for EGFR was obtained from Vysis. SpectrumGreen- 
labeled chromosome 7 and 17 centromere probes (Vysis) were used as a 
reference. A tissue microarray containing 612 formalin-fixed, paraffin-embed- 
ded primary breast cancers (17) was applied in FISH analyses as described 
(18). The use of these specimens was approved by the Ethics Committee of the 
University of Basel and by the N1H. Specimens containing a 2-fold or higher 
increase in the number of test probe signals, as compared with corresponding 
centromere signals, in at least 10% of the tumor cells were considered to be 
amplified Survival analysis was performed using the Kaplan-Meier method 
and the log-rank test. 

RT-PCR. The HOXB7 expression level was determined relative to 
GAPDH. Reverse transcription and PCR amplification were performed, using 
Access RT-PCR System (Proracga Corp., Madison, WI) with 1 0 ng of mRNA 
as a template. HOXB7 primers were 5'-GAGCAGAGGGACTCGGACTT-3' 
and S'^GTOVGGTAGCGATTGTAG-S'. 

RESULTS 

Global Effect of Copy Number on Gene Expression. 13,824 
arrayed cDNA clones were applied for analysis of gene expression 
and gene copy number (CGH raicroarrays) in 14 breast cancer cell 
lines- the results illustrate a considerable influence of copy number 
on gene expression patterns. Up to 44% of the highly amplified 
transcripts (CGH ratio, >2.5) were overexprcssed {i.e.. belonged to 
the global upper 7% of expression ratios), compared with only 6% for 
genes with normal copy number levels (Fig. I A). Conversely, 10.5% 
of the transcripts with high-level expression (cDNA ratio, >10) 
showed increased copy number (Fig. IB). LowMevel copy number 
increases and decreases were also associated with similar, although 
less dramatic, outcomes on gene expression (Fig. 1). 

Identification of Distinct Breast Cancer AmpUcons. Base-pair 
locations obtained for 1 1,994 cDNAs (86.8%) were used to plot copy 
number changes as a function of genomic position (Fig. 2, Supple- 
ment Fig. A). The average spacing of clones throughout the genome 
was 267 kb. This high-resolution mapping identified 24 independent 
breast cancer amplicons, spanning from 0.2 to i2 Mb of DNA (Table 
1), Several amplification sites detected previously by chromosomal 
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CGH were validated, with lq21, 17ql2-q2L.2, 17q22-q23, 2<kfci3.i, 
and 20ql3.2 regions being most wwnmonly amplified Furthermore! 
the boundaries of these amplicons were precisely delineated. In ad- 
dition, novel araplicons were identified at 9pl3 (38.65-39.25 j^m 
and 17q21. 3 (52.47-55.80 Mb). 

Direct Identification of Putative Amplification Target G^nes, 
The cDNA/CGH microarray technique enables the direct cancela- 
tion of copy number and expression data on a genc-by-gene basts 
throughout the genome. We directly annotated high-resolution 
CGH plots with gene expression data using color coding. Fig. 2C 
shows that most of the amplified genes in the MCF-7 breast cancer 
cell line at lpl3, 17q22-q23, and 20ql3 were highly ovesrex- 
pressed. A view of chromosome 7 in the MDA-468 cell line 
implicates EGFR as the most highly overexpressed and amplified 
gene at 7pl J-pl2 (Fig. 3A). In BT-474, the two known ampl icons 
at I7ql2 and I7q22-q23 contained numerous highly overex- 
pressed genes (Fig. 3/7). In addition, several genes, including the 
homeobox genes HOXB2 aod rYO*B7 f were highly amplified in a 
previously undescribed independent amplicon at I7q21.3. HOJCB7 
was systematically amplified (as validated by FISH, Fig. 35, insei) 
as well as overexpressed (as verified by RT-PCR, data not shown) 
in BT-474, UACC812, and ZR-75-30 ceils. Furthermore, this novel 




Fig. 3. Annotation of gene expression data on CGH microarray profile*. A, genet in the 
7 P H-pl2 ttuplicon in the MDA-468 cell Hoc ire highly expressed (red Jot*) and include 
the EGFR oncogene. B. several genes in the I7ql2. 17q2l.3, and I7cj23 amplicons in the 
BT-474 breast cancer cell line arc highly ovemxprcssed (rrd) and include the HOXB7 
gene. The data labels and color coding arc as indicated for Fig. 2C Insets show 
chromosomal CGH profiles for the corresponding chromosomes and validation of the 
Increased copy number by interphase FISH using EGFR (red) and chromosome 7 
centromere probe (green) to MDA-46& {A) and MXB7*poaGc probe (red) and chro- 
mosome 17 centromere (green) to BT-474 cells (B). 
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Fig. 4. List of 50 genes with a statistically 
significant correlation (a value <0.O5) between 
gene copy number and gene expression. Name, 
chromosomal location, and the a value for each 
gem are indicated The genes have been ordered 
according to their position in the genome. The color 
maps on the right illustrate the copy number and 
expression ratio patterns in the 14 cell lines. The 
key to the color code is shown at the bottom of the 
graph. Gray squares, missing values. The complete 
list of 270 genes is shown in supplemental Fig. B. 
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amplification was validated to be present in 10.2% of 363 primary 
breast cancers by FiSH to a Ussue microarray and was associated 
with poor prognosis of the patients (P = OiOOI). 

Statistical Identification and Characterization of 270 Highly 
Expressed Genes in Ampllcons. Statistical comparison of expres- 
sion levels of all genes as a function of gene amplification identified 
270 genes whose expression was significantly influenced by copy 
number across all 14 cell lines (Fig. 4, Supplemental Fig. B). Accord- 
ing to the gene ontology data,* 91 of the 270 genes represented 
hypothetical proteins or genes with no functional annotation, whereas 
179 had associated functional information available. Of these, 151 
(84%) are implicated in apoptosis, cell proliferation, signal transduc- 
tion, and transcription, whereas 28 (16%) had functional annotations 
that could not be directly linked with cancer. 



* Internet address: http7Avww.geneofttotogy^rg/. 



DISCUSSION 

The importance of recurrent gene and chromosome copy number 
changes in the development and progression of solid tumors has been 
characterized in >1000 publications applying CGH* (9, 10), as well 
as in a large number of other molecular cytogenetic, cytogenetic, and 
molecular genetic studies. The effects of these somatic genetic 
changes on gene expression levels have remained largely unknown, 
although a few studies have explored gene expression changes occur- 
ring in specific amplicons (15, 19-21). Here, we applied genome- 
wide cDNA microarrays to identify transcripts whose expression 
changes were attributable to underlying gene copy number alterations 
in breast cancer. 

The overall impact of copy number on gene expression patterns was 
substantial with the most dramatic effects seen in the case of high- 



* Internet address: httpy/www.nchi.nlmjilh.gov/eMre2. 
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level copy number increase. Low-level copy number gains and losses between HOXB7 amplification and poor patient prognosis. Ov^^ii 

a|» had awgn^can^fraence on expression levels of genes in the our results illustrate how the identification of genes activate*^ 

regions affected, but these effects were more subtle on a gene-by-gene gene amplification provides a powerful approach to hteK^JZ 

baas than those of high-level amplifications. However, the impact of genes with an important role in cancer as well as to prioritize* JLd 

low-ievel gams on the deregulation of gene expression patterns in validate putative targets for therapy development 



cancer may be equally important if not more important than that of 
high-level amplifications. Aneuploidy and low-level gains and losses 
of chromosomal arms represent the most common types of genetic 
alterations in breast and other cancers and, therefore, have an influ- 
ence on many genes. Our results in breast cancer extend the recent 
studies on the impact of aneuploidy on global gene expression pat- 
terns in yeast cells, acute myeloid leukemia, and a prostate cancer 
model system (22-24). 

The CGH microarray analysis identified 24 independent breast 
cancer ampHcons. We defined the precise boundaries for many am- 
plicons detected previously by chromosomal CGH (9, 10, 25, 26) and 
also discovered novel ampHcons that had not been detected previ- 
ously, presumably because of their small size (only 1-2 Mb) or close 
proximity to other larger ampHcons. One of these novel amplicons 
involved the bomeobox gene region at 17q2IJ and led to the over- 
expression of the HOXB7 and HOXB2 genes. The homeodomain 
transcripUon factors are known to be key regulators of embryonic 
development and have been occasionally reported to undergo aberrant 
expression in cancer (27, 28). HOXB7 transection induced ceil pro- 
liferation in melanoma, breast, and ovarian cancer cells and increased 
tumorigenicity and angiogenesis in breast cancer (29-32). The pres- 
ent results imply that gene amplification may be a prominent mech- 
anism for overexpressing HOXB7 in breast cancer and suggest that 
HOXB7 contributes to tumor progression and confers an aggressive 
disease phenotype in breast cancer. This view is supported by our 
finding of amplification of HOXB7 in 10% of 363 primary breast 
cancers, as well as an association of amplification with poor prognosis 
of the patients. 

We carried out a systematic search to identify genes whose 
expression levels across all 14 cell lines were attributable to 
amplification status. Statistical analysis revealed 270 such genes 
(representing ~2% of all genes on the array), including not only 
previously described amplified genes, such as HER-2, AfYC, 
EGFR % ribosomal protein s6 kinase, and A/B3, but also numerous 
novel genes such as NRAS-retated gene (lp!3), syndecan-2 (8o22), 
and bone morphogenic protein (20ql3.1), whose activation by 
amplification may similarly promote breast cancer progression. 
Most of the 270 genes have not been implicated previously in 
breast cancer development and suggest novel pathogenetic mech- 
anisms. Although we would not expect all of them to be causally 
involved, it is intriguing that 84% of the genes with associated 
functional information were implicated in apoptosis, cell prolifer- 
ation, signal transduction, transcription, or other cellular processes 
that could directly imply a possible role in cancer progression. 
Therefore, a detailed characterization of these genes may provide 
biological insights to breast cancer progression and might lead to 
the development of novel therapeutic strategies. 

In summary, we demonstrate application of cDNA microarrays 
to the analysis of both copy number and expression levels of over 
12,000 transcripts throughout the breast cancer genome, roughly 
once every 267 kb. This analysis provided: (a) evidence of a 
prominent global influence of copy number changes on gene 
expression levels; (6) a high-resolution map of 24 independent 
amplicons in breast cancer; and (c) identification of a set of 270 
genes, the overexpression of which was statistically attributable to 
gene amplification. Characterization of a novel ampHcon at 
17q2i.3 implicated amplification and overexpression of the 
HOXB7 gene, in breast cancer, including a clinical association 
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Microarray analysis reveals a major direct role of 
DNA copy number alteration in the transcriptional 
program of human breast tumors 
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Genomic DNA copy number alterations are key genetic events in 
*Mevelopment and progression ol human cLcers- Here w" 
E r^! mtooarray comparative genomic hybrid- 
a sflsSj ^ GH> K ana Ws of ONA copy number variation in 
of Prin»ry human breast tumors. We have profiled DNA 
copy number alteration across 6.691 mapped human genes, in 44 
predominant* advanced, prfmary breast tumors and 10 breast 

£T? 2? "V* 0,6 OVera " P**"™ of DMA ampiincXn 
SL1% ? C ° ,r ° borate P revlous cytogenetic studies, the high- 
resolution (gene-by-gene) mapping of amplicon boundaries and 
quantitative analysis of amplicon shape provide significant 
bnprovemem in the localization of candidate on^Tnes PaS 

^! r ? V ^ Ur< ^ entt 01 mRNA ,eve,s reveal fe remarkable 
?3ZL , Varia,,on in W «>PV number contribute, to 
IhTltx^f^Z'^^J" tUm ° r Ce,k - ^fically. we find 
^ an " J » ifled 9enes show moderately or highly 
eXpreSSion ' * at ONA Wy number Influences gene ex* 
fd^nr 0 " IT'^ ran9e °' DNA «W number alterations 
i 2 ^'^"'"f > '^ nd ^'^'amplification). thaton average, 
a 2-fold change in DNA copy number is associated with a coire- 
sponding 1.5-fold change in mRNA levels, and that overall, at least 

luZlui ^1 " 3 ^°" ,n 9en<? session among the breast 
S ^ direcdy attibuteble to underlying variation in genecopy 

co™^^^ Pr ° Vide eVidence * at widespread DNA 
copy number alteration can lead directly to global deregulation of 

pToo^oT^"* ^ ^ C ° n,ribU,e to ,he O^pmem ol 
progression of cancer. 

Conventional cytogenetic techniques, including comparative 

cation of a number of recurrent regions of DNA copy oumbcr 
alteration m breast cancer cell lines and tumors (2-4). While 
uT fnvvf,?* 0 ™ COnUin known or candidate oncogenes 

il&flnli^ISL* 2 ?* 13 * and tujnor Oppressor genes 

8ri r.m 8 » n ♦ ^ £ ■ nd 17 <|22-24, and loss of 

8p) remain to be identified. A high-resolution genome-wide 
map dehneatmg the boundaries of DNA copy number alter- 
ations in tumors, should facilitate the localization and identify 

cancer, in (his study, we have created such a raao usin* 
?'7*** d (5-7) to profile DNA copy number aUera ion 
in a senes o breast cancer cell lines and primary tumors. 
nC2 "* rew,vcd 4 ue f is the extent to which the widespread 
in ^cf y . nUmbCr f hangCS w ™ d othws "»ve identified 
r JZ L™™ aU ,V*P<**ion of *«« Solved 
JhfJS ^ U f G W £ had ««««d mRNA levels in parallel in 
an SmJ™? l J (8 ' V sing * C Same DNA microarra^ we haS 

between DMA* ^a!^ 8 gen ° mic SCalc the ^ionship 
between DNA copy number changes and gene expression. From 

www.pnasu)rgA 3 |/dol/10. t073/pnai.16247 1999 



this analysis we have identified a significant impact of wide- 

S^f\'?L numbcr a,teration °* transcriptional 
programs of breast tumors. 

Materials and Methods 

Tumors and Cell tines. Primary breast tumors were predominantly 
large (>3 cm), intermediate-grade, infiltrating ductal carcino- 
mas^ with more than 50% being lymph node positive. The 
fraction of tumor cells within specimens averaged at least 50% 
Details of mdrvidual tumors have been published (8. 9Y anri 
are summarized in Table 1, which is published as supportine 
information on the PNAS web site, ww.pnas.org. BreastWe? 
eel! lines were obtained from the American Type Culture 
Collection. Genomic DNA was isolated either usST* Qiaj?cii 
genomic DNA columns, or by phenol/chloroform extraction 
followed by ethanol precipitation. 

ONA Labeling and Microarray Hybridizations. Genomic DNA label- 
ing and hybridizations were performed essentially as describee! 

l °r 55?* * ft l V\ s,I *« ht edifications- Two micrograms 
of DNA was labeled m a total volume of 50 microliters and the 
volumes of all reagents were adjusted accordingly, 'Test** DNA 
(from tumors and cell lines) was f hiorescentiy labeled (Cy5) and 
hybridized to a human cDNA microarray containing 6,691 
different mapped human genes (i.e., UniGene clusters), the 
reference^ (labeled with Cy3) for each hybridization was nor- 

T ,C . ukocyte DNA from a donor. Tne fabrication 

of cDNA microarrays and the labeling and hybridization of 
mRNA samples have been described (8). 

Oata /Uialysls and Map Positions. Hybridized arrays were scanned 
on a GenePrx scanner (Axon Instruments, Foster City. CAY and 
f iuwescence ratios (test/reference) calculated using scanalyze 
software (available at http://rana.lbl.gov). Fluorescence ratios 
were normalized for each array by setting the average \or 
fluorescence Tatio for all array elements equal to 0 Measure- 
ments with fluorescence intensities more than 20% above back- 
ground were considered reliable DNA copy number profiles 
that deviated significantly from background ratios measured in 
normal genomic DNA control hybridizations were interpreted as 
evidence of real DNA copy number alteration (see Estimatinc 
Significance of Altered Fluorescence Ratios in the supporting 
information). When indicated, DNA copy number profiles are 
displayed as a moving average (symmetric 5-nearest neighbors). 
Map positions for arrayed human cDNAs were assigned by 



Abbreviation; CGH. comparative genomic hybridization. 

Tw^S^! TO^?™* 1 * a<,dr8t ** d at <*P**"«t of Pathology Stanford 
0f "«^CCSR BuD^ng. Room 324SA 269 Camou, Dr^tTfo* 
CA 94J0S-SI 76. E^naH: potUck 10nanford.edu. 
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PNAS | October 1.2002 | vol. 99 | no. 20 | 12W3-129W 




™<^.«l*<<<™tma Pr «dh U man g e^r^ 

5-nearest neighbors) fluorescent, ratios (tefl/r e f w ^ a^!^^" i„^^^!^ ^ P - fr0m 'P 1 "* 1 ™*^ Xqt«r. Moving average (symmetric 
told-ampHBaUon, green lummescence reneoi ZJ32SST ^^Lt^^I^^^ *"* "** * at ^ <"™^ncVr«fl««s 

<opy number profiles across the X chromosome, shown for cell lines containing different numbers of X chromosome aSUml <lata *' ® Enl * r 9 e<1 v1ew °' DNA 



^f^r 8 thc star,i " 8 P 05 "* 00 of 4,16 0<!St "»<J longest match of 
any DNA sequence represented in the corresponding UniGcne 
-0Q) against the "Golden Path" genome 8 assembty 
(http.//genome.ucsc.cdu/; Oct 7, 2000 Freeze). For UniGene 
clusters represented by multiple arrayed elements, mean fluo- 
rescence rauos (for all elements representing the same UniGene 
cluster) are reported. For tnRNA measurements, fluorescence 
» .os are "mean-centered" (Le, reported relative to the mea~ 
ratio across the 44 tumor samples). The data set described here 
can be accessed .n its entirety in the supporting information. 

Results 

We performed CGH on 44 predominantly locally advanced. 

SKS? 7 • r0aSt tUmors and 10 breast c ancer cell lines, using 
CDNA m.croarrays containing 6,691 different mapped human 
genes (Fig. la; also see Materials and Methods for details of 

^T ay f ^ rWi f t ^ ns) - To takc m advanta ee of the im- 
proved spattal resolution of array CGH, we ordered (fluores- 

£?h» Z° S /^ 6fi91 accordin 8 » the "Golden 

rath (http://genome.ucsc.edu/) genome assembly of the draft 
human genorne sequences (11). fa so doing, arrayed cDNAs not 
Z S f mse m re P res « n « of potential interest (e.g., 

2nt£u °" c ° 8 t e T "i. ,hin "V""™). but also provide prectee 
genet.clandmarksfor*romosomalregioftsof a mplificationand 
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deletion. Parallel analysis of DNA from cell lines containing 
different numbers of X chromosomes (Fig. 16), as we did before 
(7). demonstrated the sensitivity of our method to detect single- 

THJSlJv'J^k™? l f; t 47 ^). * (48.XXXX), or 
2.5-fotd (49.XXXXX) guns (also see Fig. 5, which is published 
as supporting information on the PNAS web site). Fluorescence 
ratios were linearly proportional to copy number ratios, which 
were slighUy underestimated, in agreement with previous ob- 
servations (7). Numerous DNA copy number alterations were 
evident in both the breast cancer cell lines and primary tumors 
(big. la), detected m the tumors despite the presence of euploid 
non-tumor cell types; the magnitudes of the observed changes 
were generally lower In the tumor samples. DNA copy-number 
alterations were found in every cancer cell line and tumor, and 
on every human chromosome in at least one sample. Recurrent 
regions of DNA copy number gain and loss were readily iden- 
tifiable. For example,. gains within lq, 8q, I7q, and 20q were 
8 Proportion of breast cancer cell lines/tumors 
(90%/69%, 100%/47%, 100%/60%, and 90%/44%, respective- 
ly), as were losses within lp, 3p, 8p, and 13a .80%/24«, 

with put? ished cytogenetic studies (refc. 2-4?a complete listing 
of gains/losses is provided in Tables 2 and 3, which are published 
as supporting information on the PNAS web site). The total 
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hf^r^ r ° m "; * eart «™I'«« foir tumors. **« carter cell lines and tumors are separately ordered by hierSTcW^to 

C0W num 1 b€r / * a ?9^ 241 gene* present on the microarrays and mapping to ohromosome 8 are ordered b/^o^ng the 
chromosome. Fluorescence rat os (tert/reference) are depicted by a loo, pseudocolor scale (indicated). Selected genes are Indicated with i^eX* fr«f 

of iamptes displaying the ONA copy number change). The map positions for genes of interest that are not represented on the microarray are Indicated bi*e 
ZrSZ^T Represented cm the .array, (b) graphical display of DNA copy number profile for breast cancer cel. line SKBfS Fluorescence ratio* 
(tumor/normal) are plotted on a log, scale for chromosome 8 ger.es, ordered along the chromosome. 



number of genomic alterations (gains and losses) was found to 
be significant^ higher in breast tumors that were high grade (P » 
0.008), consistent with published OGH data (3), estrogen recep- 
tor negative {P - 0.04), and harboring TP53 mutations (P - 
0.0006) (see Table 4, which is published as supporting informa- 
tion on the PNAS web site). 

The improved spatial resolution of our array COH analysis is 
illustrated for chromosome 8, which displayed extensive DNA 
copy number alteration in our series. A detailed view of the 
variation in the copy number of 241 genes mapping to chromo- 
some 8 revealed multiple regions of recurrent amplification; 
each of these potentially harbors a different known or previously 
uncharacterized oncogene (Fig. 2a), The complexity of amplicdn 
structure is most easily appreciated in the breast cancer cell line 
SKBR3. Although a conventional CGH analysis of 8q in SKBR3 
identified only two distinct regions of amplification (12), we 
observed three distinct regions of high-level amplification (la- 
beled 1-3 in Fig. 26). For each of these regions we can define the 



boundaries of the interval recurrently amplified in the tumors we 
examined; in each case, known or plausible candidate oncogenes 
can be identified (a description of these regions, as well as the 
recurrently amplified regions on chromosomes 17 and 20, can be 
found in Figs. 6 and 7, which are published as supporting 
information on the PNAS web site). 

For a subset of breast cancer cell lines and tumors (4 and 37, 
respectively), and a subset of arrayed genes (6,095), mRNA 
levels were quantitatively measured in parallel by using cDNA 
microarrays (8). The parallel assessment of mRNA levels is 
useful in the interpretation of DNA copy number changes. For 
example, the highly amplified genes that are also highly ex- 
pressed are the strongest candidate oncogenes within an ampli- 
con. Perhaps more significantly, our parallel analysis of DNA 
copy number changes and mRNA levels provides us the oppor- 
tunity to assess the global impact of widespread DNA copy 
number alteration on gene expression in tumor cells. 

A strong influence of DNA copy number on gene expression 
is evident m an examination of the pseudocolor representations 
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of DNA copy number and mRNA levels for genes on chromo- 
some 17 (Fig. 3). The overall patterns of gene amplification and 
elevated gene expression are quite concordant; i.c, a significant 
fraction of highly amplified genes appear to be correspondingly 
highly expressed. The concordance between high-level amplifi- 
cation and increased gene expression is not restricted to chro- 
mosome 17. Genome-wide, of 117 high-level DNA amplifica- 
tions (fluorescence ratios >4, and representing 91 different 
genes), 62% (representing 54 different genes; see Table 5, which 
is published as supporting information on the PNAS web site) 
are found associated with at least moderately elevated mRNA 
levels (mean-centered fluorescence ratios >2), and 42% (rep- 
resenting 36 different genes) arc found associated with compa- 
ratios C,Cvatcd mRNA lcveU (m*"»<«"ei«d fluorescence 

To determine the extent to which DNA deletion and lower- 
level amplification (in addition to high-level amplification) are 
also associated with corresponding alterations in mRNA levels, 
we performed three separate analyses on the complete data set 
(4 cell lines and 37 tumors, across 6,095 genes). First, we 
determined the average mRNA levels for each of five classes 
of genes, representing DNA deletion, no change, and low-, 
medium-, and high-level amplification (Fig. 4a). For both the 
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breast cancer cell lines and tumors, average mRNA levels 
tracked with DNA copy number across all five classes, in a 
statistically significant fashion (P values for pair-wise Student's 
r tests comparing adjacent classes: cell lines, 4 x 10"^, 1 X 10-«* 
5 X 10-*, 1 X 10-* tumors. 1 X 10~«, 1 X V0r*\ 5 X l<r» 
1 x 10' 4 ), A linear regression of the average Jog(DNA copy 
number), for each class, against average log(mRNA level) 
demonstrated that on average, a 2-fold change in DNA copy 
numbcrwas accompanied by 1,4- and 15-foidchangesinmRNA 
level for the breast cancer cell lines and tumor*, respectively (Fig. 
4a, regression line not shown). Second, we characterized the 
distribution of the 6,095 correlations between DNA copy num- 
ber and mRNA level, each across the 37 tumor samples (Fig. 4b). 
The distribution of correlations forms a normal-shaped curve, 
but with the peak markedly shifted in the positive direction from 
zero. This shift is statistically significant, as evidenced in a plot 
of observed vs. expected correlations (Fig. 4c), and reflects a 
pervasive global influence of DNA copy number alterations on 
gene expression. Notably, the highest correlations between DNA 
copy number and mRNA level (the right tail of the distribution 
in Fig. 4b) comprise both amplified and deleted genes (data not 
shown). Third, we used a linear regression model to estimate the 
fraction of all variation measured in mRNA levels among the 37 



Pollack star. 



corrctatoncoeffideiY 





-i 1 1 1 r- 

-0J5 -0.4 4)2 qjo 0-2 



Mand-yftacfcgraind cutoff 



i J) 

..1 

a 
a 

0.4 Q 



is 



fig. 4. Genome-wide influence of ONA copy number alterations on mRNA ievek (a) For breast cancer cell lines faravl and turner «mni~ k^u 
mean<emered mRNA fluorescence ratio (log, scale) quartOes (box p.ots indicate 25th/soth, .^tn^l^ a^^l^ ^td^^SL™t^ 
^Kae standard errorsofthe^ 
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tumors that could be attributed to underlying variation in DNA 
copy number. From this analysis, wc estimate that, overall, about 
7% of all of the observed variation in mRNA levels can be 
explained direcdy by variation in copy number of the altered 
genes (Fig. Ad). We can reduce the effects of experimental 
measurement error on this estimate by using only that fraction 
of the data most reliably measured (fluorescence Intensity/ 
background >3); using that data, our estimate of the percent 
variation in mRNA levels directly attributed to variation in gene 
copy number increases to 12% (Fig. 4^). This stillundoubtedly 
represents a significant underestimate, as the observed variation 
in globaJ gene expression is affected not only by true variation in 
the expression programs of the tumor cells themselves, but also 
by the variable presence of non-tumor cell types within clinical 
samples. 

Discussion 

This genome^wide, array COH analysis of DNA copy number 
alteration in a series of human breast tumors demonstrates the 
usefulness of defining ampKcon boundaries at high resolution 
(gene-by-gene), and quantitatively measuring amplicon shape, to 
assist in locating and identifying candidate oncogenes. By ana- 
lyzing mRNA levels in parallel, we have also discovered that 
changes tn DNA copy number have a large, pervasive, direct 
effect on global gene expression patterns in both breast cancer 
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ceil lines and tumors. Although the DNA microarrays used in our 
analysis may display a bias toward characterized and/or highly 
expressed genes, because we are examining such a large fraction 
of the genome (approximately 20% of all human genes), and 
because, as detailed above, we are likely underestimating the 
contribution of DNA copy number changes to altered gene 
expression, wc believe our findings arc likely to be generalizable 
(but would nevertheless still be remarkable if only applicable to 
this set of -6,100 genes). r 

In budding yeast, aneuploidy has been shown to result in 
chromosome-wide gene expression biases (13). Two recent 
studies have begun to examine the global relationship between 
DNA copy number and gene expression in cancer cells. In 
agreement with our findings. Phillips et aL (14) have shown that 
with the acquisition of tumorigenicity in an immortalized pros- 
tate epithelial cell line, new chromosomal gains and losses 
resulted in a statistically significant respective increase and 
decrease in the average expression level of involved genes. In 
contrast, Platzer et al. (15) recently reported that in metastatic 
colon tumors only -4% of genes within amplified regions were 
found more highly (>2-fold) expressed, when compared with 
normal colonic epithelium. This report differs substantially from 
our finding that 62% of highly amplified genes In breast cancer 
exhibit at least 2-fold increased expression. These contrasting 
findings may reflect methodological differences between the 
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studies. For example, the study of Platzer et al (15) may have 
systematically under-measured gene expression changes. In (his 
regard it is remarkable that only 14 transcripts of many thousand 
residing within unamplifted chromosomal regions were found to 
exhibit at least 4-fold altered expression in metastatic colon 
cancer. Additionally, their reliance on lower-resolution chromo- 
somal CGH may have resulted in poorly delimiting the bound- 
aries of high-complexity amplicons, effectively overcalling re- 
gions with amplification. Alternatively, the contrasting findings 
for amplified genes may represent real biological differences 
between breast and metastatic colon tumors; resolution of this 
issue will require further studies. 

Our finding that widespread DNAcopy number alteration has 
a large, pervasive and direct effect on global gene expression 
patterns in breast cancer has several important implications. 
First, this finding supports a high degree of copy number- 
dependent gene expression in tumors. Second, it suggests that 
most genes are not subject to specific autoregulation or dosage 
compensation. Third, this finding cautions that elevated expres- 
sion of an amplified gene cannot alone be considered strong 
independent evidence of a candidate oncogene's role in turaor- 
igencsis. In our study, fully 62% of highly amplified genes 
demonstrated moderately or highly elevated expression. This 
highlights the importance of high-resolution mapping of ampli- 
con boundaries and shape [to identify the "driving" geoe(s) 
within amplicons (16*) J, on a large number of samples, In addition 
to functional studies. Fourth, this finding suggests that analyzing 
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the genomic distribution of expressed genes, even within existing 
microarray gene expression data sets, may permit the inference 
of DNA copy number aberration, particularly aneuploidy (where 
gene expression can be averaged across large chromosomal 
regions; see Fig. 3 and supporting information). Fifth, this 
finding implies that a substantial portion of the phenotypic 
uniqueness (and by extension, the heterogeneity in clinical 
behavior) among patients' tumors may be traceable to underly- 
ing variation in DNA copy number. Sixth, this finding supports 
a possible role for widespread DNA copy number alteration in 
tumorigencsis d?, 18), beyond the amplification of specific 
oncogenes . and deletion of specific tumor suppressor genes. 
Widespread DNA copy number alteration, and the concomitant 
widespread imbalance in gene expression, might disrupt critical 
stocMoraetric relationships in cell metabolism and physiology 
(e.g., proteosome, mitotic spindle), possibly promoting further 
chromosomal instability and directly contributing to tumor 
development or progression* Finally, our findings suggest the 
possibility of cancer therapies that exploit specific or global 
imbalances in gene expression in cancer. 
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Abstract 

Background: Colorectal cancer is a common cancer all over the world. Aberrations in the cell 
cycle checkpoints have been shown to be of prognostic significance in colorectal cancer. 

Methods: The expression of cydin D/ t cydin A, histone H3 and K/-67 was examined in 60 colorectal 
cancer cases for co-regulation and impact on overall survival using immunohistochemistry, 
southern blot and in situ hybridization techniques. Immunoreactivity was evaluated semi 
quantitatively by determining the staining index of the studied proteins. 

Results: There was a significant correlation between cydin Dl gene amplification and protein 
overexpression (concordance = 63.6%) and between Kf-67 and the other studied proteins. The 
staining index for Ki-67, cydin A and Dl was higher in large, poorly differentiated tumors. The 
staining index of cydin D I was significantly higher in cases with deeply invasive tumors and nodal 
metastasis. Overexpression of cydin A and D / and amplification of cydin D / were associated with 
reduced overall survival. Multivariate analysis shows that cydin 0 / and A are two independent 
prognostic factors in colorectal cancer patients. 

Conclusions: Loss of cell cycle checkpoints control is common in colorectal cancer. Cydin A and 
D / are superior independent indicators of poor prognosis in colorectal cancer patients. Therefore, 
they may help in predicting the clinical outcome of those patients on an individual basis and could 
be considered important therapeutic targets. 



Background 

Colorectal cancer (CRC) is the third most common cancer 
in Western countries In Egypt, CRC has unique char- 



acteristics that differ from that reported in other countries 
of the western society. It was estimated that 35.6% of the 
Egyptian CRC cases are below 40 years of age and patients 
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usually present with advanced stage, high grade tumors 
that carry more mutations [2]. This uniquely high propor- 
tion of early-onset CRC, the early and continuous expo- 
sure to hazardous environmental agents, the different 
mutational spectrum and the prevalent consanguinity in 
Egypt justify further studies (3). It was proved that most 
cancers result from accumulation of genetic alterations 
involving certain groups of genes, the majority of which 
are cell cycle regulators that either stimulate or inhibit cell 
cycle progression [1]. Cell proliferation allows orderly 
progression through the cell cycle, which is governed by a 
number of proteins including cyclins and cyclin dependent 
kinases [4,5]. The cyclins belong to a superfamily of genes 
whose products complex with various cyc/w-dependent 
kinases (cdks) to regulate transitions through key check- 
points of die cell cycle (6). Abnormalities of several cyclins 
have been reported in different tumor types, implicating, 
in particular, cyclin A t cyclin E and cyclin D [6,7|. 

Cyclin Dl is a Gl cyclin that regulates the transition from 
Gl to S phase since its peak level and maximum activity 
are reached during the Gl phase of the cell cycle. Whereas 



cyclin A is regarded a regulator of the transition to mitosis 
since it reaches its maximum level during the S and G2 
phases (8). The mechanisms likely to activate the onco- 
genic properties of the cyclins include chromosomal trans- 
locations, gene amplification and aberrant protein 
overexpression [7,9]. 

Several studies have shown that, histone H3 mRNA expres- 
sion can be used to identify the S phase fraction (SPF) 
through the in situ hybridization (ISH) technique 1 10,1 1]. 
The level of histone H3 mRNA reaches its peak during the 
S phase and then drops rapidly at the G2 phase [12|. 

In face of the increasing incidence of CRC and its peculiar 
pattern in the Egyptian population, the present study was 
conducted to assess the role of Ki-67 (pan-cell cycle 
marker), cyclin Dl (Gl phase marker), histone H3 mRNA 
(S phase marker), cyclin A (S to G2 phase marker) in CRC. 
The expression level of these markers was correlated to the 
clinicopathologic features and the overall survival of 
patients. 



Table I: Cllnicopathotoglcal features of patients in relation to the staining index (SI) of KI-67, cydin Df, cyclin A, histone H3 

St (mean + SD) 



Variables 


No. of cases 


#0-67 


Cyclin Dl 


Cyclin A 


Wirtone H3 


Sex 












Male 


36 


18.0 ±6.4 


6.7 1 4.3 


12.7157 


10.715.3 


Female 


24 


20.1 ±5.8 


8.8 1 8.4 


10.016.0 


10.7 1S.4 


Age (years) 












250 


41 


11.7 ±6.0* 


5.6 ± 5.2 


10.0 1 5.3 


6.0 1 5.0* 


<S0 


19 


23.8 ± 5.6 


7.7 ± 6.8 


13.615.7 


22.0 1 5.2 


Tumor size (cm) 












<5.0 


33 


12.216.3* 


5.3 ± 3.8* 


11.516.1* 


10.314.9* 


25.0 


27 


30.1 ±6.2 


22.8 1 7.2 


28.6 1 5.6 


24.015.6 


Histology 












Normal 


20 


3.5 ± 2.0* 


0.6 1 0.2* 


2.31 1.1* 


2.2 ± 0.9 


Carcinoma 


60 


30.3 ± 6.2 


24.9 ± 6.3 


27.2 1 5.8 


10.715.3 


Gl 


15 


11.7 ±6.2 


6.6 ± 4.0 


10.0 15.4 


11.414.9 


Gil 


21 


11.815.6 


8.9 ± 3.6 


12.316.5 


7.8 1 5.4 


GUI 


24 


30.0 ± 4.3 


22.0 1 8.1 


27.0 1 4.9 


11.515.4 


Lymph node 












Negative 


33 


19.517.0 


5.4 1 5.3* 


11.916.5 


12.3 1 5.5 


Positive 


27 


21.3 14.9 


20.6 1 6.9 


12.5 15.0 


14.215.0 


Depth of invasion 












m, sm 


17 


20.7 ± 6.7 


3.1 1 3.1* 


11.917.2 


10.415.1 


beyond sm 


43 


21.916.2 


12.416.5 


12.215.6 


10.715.4 


Stage 












1 


6 


20.6 1 6.7 


5.7 1 6.9 


24.2 1 6.9 


U.I ±5.3 


11 


27 


20.8 1 6.9 


5.3 1 4.3 


24.6 1 6.0 


10.4 1 S.7 


III 


12 


22.0 1 5,4 


7.7 t 6.0 


27. 1 15.2 


10.4 1 4.9 


IV 


15 


24.716.1 


11.3 19.6 


27.5 1 5.5 


12.3 16.2 



* p. value < 0.05 (significant) 
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Methods 
Tissue samples 

Paraffin-embedded tumor tissues were obtained from 60 
CRC patients (47 colon and 13 rectal carcinomas) that 
were diagnosed and treated at the National Cancer Insti- 
tute, Cairo, Egypt during the period from January, 1997 to 
June, 2002. CI in icopatho logical data of the studied cases 
are illustrated in table 1. None of the patients received any 
chemotherapy or irradiation prior to surgery. Histological 
diagnosis of all cases was done by 2 independent pathol- 
ogists according to the WHO Histological Classification. 
Tumors were staged according to the TNM staging system 
[13]. The depth of tumor invasion was classified as inva- 
sion of the mucosa including muscularis mucosa (m), 
invasion of the submucosa (sm), or invasion beyond the 
submucosa [8]. Normal colonic tissues were obtained 
from autopsy specimens (n ■ 20) and were used as a con- 
trol. The actual survival rate of the patients was calculated 
from the date of resection to the date of death. 

Immunohistochemistry 

Four micron sections of each normal and tumor specimen 
were cut onto positive-charged slides; air dried overnight, 
de-paraffin ized in xylene, hydrated through a series of 
graded alcohol and washed in distilled water and 0.01 
PBS (pH 7.4). Slides were then processed for IHC as 
described by Handa et al. [8]. using the following anti- 
bodies: Ki-67 (MIB-1, Dako), cyclin A (6E6; Novocastra, 
Newcastle-Upon-Tyne, UK) and cyclin Dl (DCS-6, Dako). 
A case of invasive breast cancer was used as a positive con- 
trol for Ki-67 and cyclin A whereas a case of mantle cell 
lymphoma was used as a control for cyclin Dl. Negative 
controls were obtained by replacing the primary antibody 
by non-immunized rabbit or mouse serum. 

Brown nuclear staining was regarded as a positive result 
for all studied markers. The proportion of positively- 
stained cells and the intensity of staining were scored in 
tumor and normal colorectal mucosal sections at medium 
power (*200). The degree of positive tumor staining (per- 
centage of positive tumor cells in the examined section) 
was scored from 1-6 and the staining intensity was scored 
from 0-6 according to the pattern of staining in the exam- 
ined section. Staining index (SI) was calculated by multi- 
plying the cellularity and staining scores as described by 
Kingetal. [I4j. 

In situ hybridization 

All tumor samples and 5 normal controls were assessed 
for histone H3 mRNA by ISH using the commercially avail- 
able 550 base fluorescein-labeled DNA probe (Dako, 
Carpinteria, CA) as described by Nagao et aL, 1996. This 
probe hybridizes to the whole mRNA transcript of the 
human hi$toneH3 gene including the5' and 3' untrans- 
lated regions. Scoring of histone H3 mRNA was performed 



as for immunohistochemistry, however, hybridization 
signals were detected in the cytoplasm. 

Molecular detection of cyclin Di gene amplification 

High molecular weight DNA was extracted from paraffin- 
embedded tissues of the tumor and normal colorectal 
mucosal samples as previously described [15). The pro- 
portion of neoplastic and normal cells was determined in 
each tumor sample by examining hematoxylin and eosin- 
stained slides obtained from the edge of the specimen 
used for DNA extraction. Tumor samples were evaluated 
for amplification of cyclin Dl if more than 75% of the 
examined sections were formed of neoplastic cells. 
Accordingly, 50 cases were eligible for the analysis. Ten 
micrograms of the extracted DNA was digested with 
EcoRl. DNA from selected cases was also digested with 
BglU and HfndlH. Samples were separated on 0.8% agar- 
ose gels and transferred to Hybond-N membranes (Amer- 
sham Int., Amersham, UK). The membranes were 
hybridized with 50% formamide, 5 x SSC, 5 * Denhardt's, 
500 ng/ml denatured salmon sperm DNA, 10% dextran 
sulphate and 10° cpm/ml of 32 P-labeled PRAD-1 probe for 
24 h. Membranes were washed with 2 * SSC, 0.1% SDS at 
room temperature for 30 min followed by 2 * SSC, 0.1% 
SDS at 60°C for 30 min and 0.1 x SSC, 0.1% SDS at 60°C 
for 1 h. Filters were autoradiographed using an intensify- 
ing screen at -70 °C for 24-72 h. After being stripped free 
of the PRAD-1 probe, the same blots were hybridized with 
32 P-labeled B-actin probe to normalize against possible 
variations in the loading or transfer of DNA. The autora- 
diograms were analyzed using a densitometer. Intensities 
of PRAD-1 /cyclin Dl were normalized to the P-aclin con- 
trol bands. The degree of amplification was calculated 
from these normalized values. Amplification was consid- 
ered when the signal of the tumor band was >2-fold the 
value of the matched normal mucosa [16]. 

Statistical analysis 

The Mann-Whitney non-parametric test was used to com- 
pare the Sis of pairs of subjects whereas the Kruskal-wallis 
was used for categorial data. Correlation between indices 
was performed using a simple linear regression test. The 
Kaplan-Meier method was used to create survival curves 
which were analyzed by the log-rank test. The impact of 
different variables on survival was determined using the 
Cox proportional hazards model, p. values less than 0.05 
were considered significant. 

Results 

The results of IHC are illustrated in figures 1 and 2. In gen- 
eral, the staining index (Sis) of all studied markers was 
higher in carcinomas than in normal colonic mucosal 
samples (p = 0.0001). Normal colorectal mucosa revealed 
positive imunostaining for Ki-67 in the lower half of the 
crypts only. A heterogeneous staining pattern was 
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Figure J 

Normal colonic mucosa showing positive nuclear immunostaining for: (a) cydin D/, (b) ISH of histone H3 mRNA, (c) Kh67 and 
(6)cydinA 



detected in the neoplastic cells of well and moderately-dif- 
ferentiated adenocarcinomas whereas a diffuse homoge- 
neous staining pattern was detected in poorly- 
differentiated carcinomas. The SI ranged from 10-40.2 
(mean: 24.6 ± 6.5). 

Immunostaining for cydin Dl was predominantly nuclear 
but cytoplasmic staining was detected in some cases. 
However, unless a nuclear staining was also detected, 
cases with cytoplasmic staining were considered negative. 
Normal colorectal mucosal samples were almost negative 
for cydin Dl whereas 41 out of the 60 (68.3%) CRC cases 
were positive. Marked heterogeneity was observed in well- 
and moderately-differentiated adenocarcinomas even 
within the same tumor. Poorly-differentiated carcinomas 
revealed a diffuse staining pattern with more darkly- 
stained nuclei. The SI ranged from 0.5-28,6 (mean: 9.3 ± 
4.2). 



Positive nuclear staining for cydin A was detected in 80% 
(48/60) of CRC cases and in all non-neoplastic control 
samples. Positively-stained nuclei were confined to the 
lower half of the crypts in normal colonic mucosa and dif- 
fusely-dispersed in carcinomas. The SI ranged from 3.3- 
30.2 (mean: 15.1 ± 6.6). 

Histone H3 mRNA was intensely expressed in the cyto- 
plasm of all examined samples either neoplastic or non- 
neoplastic. The distribution of histone H3 mRNA was 
similar to that of cydin A and Ki-67 however, the propor- 
tion of histone H3 mRNA positive cells was less than thai 
of Ki67. The SI ranged from 1.8-24.2 (mean: 12.4 ± 5.3). 

The PRAD-1 probe detected 3 Eco RI fragments of 4.0, 2.2 
and 2.0 and 1 Bgfll fragment of 15 Kb. PRAD-Jfcyclw Dl 
gene amplification was detected in 22/50 (44%) cases 
analyzed. The degree of amplification was heterogeneous 
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Figure 2 

A case of well differentiated adenocarcinoma with positive immunostaining for: (a) cydin D/ f (b) histone H3 mRNA, (c) Ki-67, 
and (d) cydin A. Another case of moderately differentiated denocarcinoma with positive immunostaining for; (e) cydin D I , (f) 
histone H3 mRNA, (g) Ki-67, and (h) cyclin A. A case of poorly differentiated adenocarcinoma with diffuse staining for: (i) cycJ/r? 
D/, (j) ISH of histone H3 mRNA. (k) fQ-67 and (I) cydin A. 



with 2-10 fold increase when compared to normal 
mucosal samples (Figure 3). Amplification was confirmed 
by other restriction enzymes. 

Correlations 

There was a significant correlation between cyclin Dl gene 
amplification and protein overexpression. Out of the 22 



cases that showed amplification 14 showed protein over- 
expression (concordance = 63.6%). 

Linear regression analysis of Sis revealed a significant cor- 
relation between Ki-67 and cyclin DL cyclin A, histone H3 
as well as between the Sis of cyclin A and histone H3 (p = 
0.008, 0.000 1, and 0.0001 respectively) (Figure 4). There 
was a significant relationship between the SI of both Ki-67 



Page 5 of 12 
(page number not for citation purposes) 



BMC Gastroenterology 2004, 4:22 



ht^://www.biomedcentral.com/1471-230X/4/22 



NT NT NTNT NTNT NT 



PRAD-1 


■■■ W 










w W 




















B-Actin 


rxa 






Li iJtf 










N 


T 


N T 


N 


T 





15 KG 



B 




B-Actin 



Figure 3 

A: Southern blot analysis of normal mucosa (N) and their seven corresponding cases of colonic adenocarcinomas (TI-T7), 
cases No. I, 2, 4, and 5 are poorly differentiated whereas cases No. 3, 6, and 7 are moderately differentiated. Genomic DNA 
was digested with Bg/ll, fractionated by electrophoresis in agarose gel. transferred onto membranes and hybridized with PRAD / 
and fiactm. Tumors number I-6 (Lanes I-6) show different degrees of PRAD l/cydin 0 / amplification, tumor number 7 (lane 7) 
was not amplified. B: Southern blot analysis of 3 cases of adenocarcinomas (T) and matched normal colonic mucosa (N). 
Genomic DNA was digested with EcoRI, fractionated by electrophoresis in agarose gel, transferred onto membranes and 
hybridized with PRAD I and fi-octin probes for loading control. The identification of the 3 tumors is the same as in Fig. 3 A with 
amplification of PRADUcydin 01 in tumors number 4, 5 (Lanes I, 2) but not 7 (Lane 3). 



and cyclin A and the degree of differentiation of tumors as 
well as the size of the tumor (p < 0.001 and p < 0.01 
respectively). In addition, SI of Ki-67 and histone H3 were 
higher in patients <50 years than in those >50 years (p < 
0.05) (table 1). 



In addition table 2 shows a significant relationship 
between high cyclin Dl SI and large, poorly-differentiated 
tumors, carcinomas with positive lymph node metastasis 
and deeply-invasive carcinomas (p < 0.05, p < 0.001, p < 
0.05 and p < 0.05 respectively). Whereas cyclin Dl gene 
amplification was significantly associated with an 
advanced disease stage since amplification was detected in 
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Figure 4 

Correlation between the staining intensity of (a) Kf-67 vs. cyclin D/, (b) Ki-67 vs. histone H3 t (c) Ki-67 vs. cyclin A and (d) cydin A 
vs. histone H3 mRNA expression. 



10/15 (66.7%) of stage rV tumors compared to 12/45 
(26.7%) of stage I-f II tumors (p = 0.002). Similarly, DNA 
amplification was detected in 60.5% (26/43) of the carci- 
nomas with extensive local invasion (beyond sm) but 
only in 23.5% (4/17) of the carcinomas with limited inva- 
sion (m, sm) (p = 0.001). A significant correlation was 
also present between cydin Dl gene amplification and the 
presence of lymph node metastasis (p = 0.008) as well as 
between the SI of histone H3, the size of the tumor and the 
patient's age (p < 0.05, p < 0.001 respectively). The SI was 
higher in tumors >5 cm in diameter and in patients <50 
years. 

Survival analysis 

The mean follow-up period for all patients was 30 months 
(range: 1-66 months). Eighteen of 60 patients had 
already died by the time the study was completed. We 



defined the cutoff level for overexpression of each cell 
cycle marker at the point that showed the maximum dif- 
ference of survival rate between the 2 groups separated by 
that point. Cox regression analysis revealed that cyclin A 
overexpression (our definition: SI £ 10.5), cyclin Dl over- 
expression (our definition: SI £ 6.1), poorly differentiated 
histology, lymph node metastasis, TNM stage, tumor size 
and depth of invasion were all significant prognostic var- 
iables for survival (Table 3). The Kaplan-Meier survival 
curves for the subgroups of patients who are subdivided 
according to each marker's status are shown in Figure 5. 
Patient with tumors that showed overexpression 
(our definition: SI > 1 1.5) and histone H3 overexpression 
(our definition: SI £ 8.2) tended to have poor prognosis 
but this did not reach a statistically significant level how- 
ever the overall survival was significantly lower in patient 
with cyclin A and cyclin Dl overexpression. Cox multivari- 
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Table 2: The relation between cydin Dt overexpresslon vs cydin Dt amplification and cllnicopat ho logical prognostic markers. 



Variables 


No. of cases Cydin Dl overexpresslon 


Cydin Dt Amplification 


Tumor size (cm) 












33 5.3 ± 3.8* 




13/33 




2S.Q 


27 2Z8 ± 7.2 p <0.05 




9/27 p <0.236 




Histology 








a 


IS 6.6 ±4.0 




7/15 




en 


21 8.9 ±3.6 




8/21 




an 


24 2Z0±8.I p<0.00l 




7/24 p <0.075 




Lymph node 








Negative 


33 5.4 ± 5.3* 




6/33(18.2%) 




Positive 


27 20.6 ± 6.9 p <0.05 




16/27 (59.3%) p<0.008 


Depth of invasion 










m, sm 


17 3.1 ±3.1* 




4/17(23.5%) 




beyond sm 


43 12.4 ± 6.5 p<0.05 




26/43 (60.5%)p<0.<M>J 


Stage 










early 


45 5.5 ±10.1 




12/45 (26.7%) 




tate 


15 1 1.3 ± 9.6 P = 0.(75 




10/15 (66.7%) p«U>02 


Table 3: Uunlvariate analysis of the relationship between survival and the tested markers 


PredictiveVariables Median Survival HR 


a 




P 


K/-67 










<ll.5 36 










SII.5 32 


1.826 


0.636 - 5.243 




0.26 


Cydin Dl 










<6.l 35 










J>6.l (8 


7.246 


1.007-45.150 




0.03* 


Histone H3 










<8.2 35 










£8.2 29 


4.639 


0.854-25.196 




0.07 


Cydin A 










<I0.5 35 










*I0.5 15 


7.820 


1.017-60.122 




0.02* 


Histological grade 










Low 38 










High (0 


7.331 


2.696- 19.940 




0.0001* 


Lymph node 










Negative 38 










Positive 15 


6.826 


1.973-23.621 




0.002* 


Stoge 










1, IK III 38 










/V 12 


6.378 


1.842-22.083 




0.001* 


Tumor size (cm) 










<5.0 35 










£5.0 13 


4.835 


1.386- 16.868 




0.01* 


Depth of invasion 










TI.T2 36 










T3.T4 20 


7.759 


1.024 - 58.789 




0.04* 


Age (years) 










<50 38 










2S0 28 


2.802 


0.988 - 7.943 




0.0526 


Sex 










Mate 38 










Female 36 


0.696 


00.274-1.766 




0.4449 



* p. value < 0.05 (significant) 

HR: Hazard Ratio 

CI: 95% confidence Interval 
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Figure 5 

Kaplan-Meier survival curves for colorectal carcinoma. Overall survival is significantly lower in patients with (a) cydin A and (b) 
cydfn 0/ overexpression. Patients with high SI for historic H3 mRNA have poorer prognosis but this was not statistically signif- 
icant (c). No significant difference was present between patients with high KJ-67 SI and those with low Kh67 SI (d). 



ate regression analysis revealed that lymph node metasta- 
sis, cydin A and cydin Dl overexpression were 
independent negative prognostic factors after adjustment 
for the depth of tumor invasion, age and sex of the patient 
(Table 4). 

Discussion 

The proliferative activity of CRC cells has been investi- 
gated in several studies either by immunohistochemical 
determination of cell proliferation index using antibodies 
to some types of cydins or by flowcyto metric determina- 
tion of the SPF of the cell cycle [8]. Although Leach et al. 
[17] did not find cydin Dl gene amplification in a panel 
of 47 CRC cell lines; its protein was overexpressed in 
about 30% of CRC cases that were included in the studies 



of Bartakova et al. [6] and Arber et al. [18]. In the former 
study \6\cyclin Dl was aberrantly accumulated in a 
significant subset of human CRC cases and the cell lines 
derived from these cases were dependent on cydin in their 
cell cycle progression. In the second study [18], overex- 
pression of cydin Dl was detected in 30% of adenomatous 
polyps indicating that overexpression is a relatively early 
event in colon carcinogenesis which is possibly responsi- 
ble for the pathological changes in the mucosa preceding 
neoplastic transformation. More recently, Holland et al. 
[19], Pasz-Walczak et al. [20] and Utsunomiya et al. [21] 
reported up-regulation of cydin Dl in 58.7%, 100% and 
43% of their studied cases respectively. 
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Table 4; Multivariate analysis ofthe relationship between survival 
and thetested markers 



PredictiveVariables 


HR 


a 


P 


Cyd'mDl 


10.864 


1.055 - 86.250 


0.03* 


(baseline < 6.1) 








Cyclin A 


13.886 


1.012- 190379 


0.0490* 


(baseline < 1 0.5) 




• 




Positive Lymph node 


3.921 


1.057- 14,472 


0.0410* 


metastasis 








Stage IV 


3.411 


1.048-12.083 


0.03* 


Depth of invasion 








T3.T4 


5.408 


0.449-65.080 


0.1836 


Age (years) 








£50 


1.996 


0.678-5^78 


0.2310 


Sex 


0.910 


0.315-2.358 


0.8453 



p. value < 0.05 (significant) 

HR: Hazard Ratio 

CI: 95% confidence Interval 



In the present study, up-regulation of cyclin Dl was 
detected in 68.3% of the cases. The SI was significantly 
higher in carcinomas than in normal colorectal mucosa 
and in poorly-differentiated adenocarcinomas it was 
approximately twice that of other histological types. 
Amplification and/or overexpression of cyclin Dl signifi- 
cantly correlated with deeply invasive tumors and positive 
lymph node metastasis. Our results in this regards are con- 
sistent with previous studies [8,22]. In 2001, Holland et 
al. [19). demonstrated that deregulation of cyclin Dl and 
p21 wa f proteins are important in colorectal tumorigenesis 
and have implications for patient prognosis. Similarly 
McKay etal. [23] found that cyclin D 1 was the only protein 
in their panel (cydin Dl, p53. p!6, Rb-1, PCNA and p27) 
that correlated with improved outcome in CRC pauents. 
However, few studies failed to detect any correlation 
between cyclin Dl overexpression and the 
clinicopathological factors in CRC [6,18). This contro- 
versy in results could partially be explained by the differ- 
ence in the sampling of studied cases. The present study 
included 24 cases of poorly differentiated adenocarci- 
noma, which is not common in other studies of CRC in 
western countries. This was possible because the majority 
of CRC cases diagnosed in Egypt are of high histological 
grade |3). The correlation between cyclin Dl overexpres- 
sion and the high histological grade was also reported in 
other tumor types including non-small cell lung 
carcinomas [24] and squamous cell carcinomas of the lar- 
ynx [16]. Another possible explanation for the observed 
controversy in the results of different studies is the detec- 
tion method used. 

In the present work, overexpression of cyclin Dl was more 
common than gene amplification of the PRAD-1 /cyclin Dl 



gene with a 63.6% concordance. This was similarly 
reported by Bartakova et al. [6] who mentioned that there 
is a subset of CRC cases in which cyclin Dl is overex- 
pressed without PRAD-1 /cyclin Dl gene amplification. 
Consistent with this hypothesis are reports of elevated cyc- 
lin Dl mRNA levels and immunohistochemically detecta- 
ble accumulation of the protein in over one third of breast 
cancer cases at a frequency significantly higher than that 
deduced from DNA amplification studies [9,25], These 
data imply that mechanisms other than gene 
amplification can also lead to deregulation and accumu- 
lation of cyclin Dl in solid tumors. 

So far, several studies were done to reveal the prognostic 
significance of cyclin Dl overexpression in various carci- 
nomas, including CRC [22]. However, these studies 
yielded conflicting results which could be attributed to 
organ heterogeneity. In our study, patients with tumors 
that exhibited cyclin Dl overexpression tended to have 
poor prognosis. 

It was reported that, patients with cyclin A positive carci- 
nomas had significantly shorter median survival times. 
Handa et al. [8] were able to delect cyclin A overexpression 
in 77% of their CRC cases. They also demonstrated that, 
cylcin A could be used as a prognostic factor of CRC. More 
recently, Habermann et al. (26] studied cases of ulcerative 
colitis with and without an associated adenocarcinoma 
for the presence of cyclin A overexpression. They found 
that, cyclin A overexpression was higher in cases of ulcera- 
tive colitis with adenocarcinomas than in those without 
adenocarcinomas. Consequently, they concluded that, 
cyclin A could be used for monitoring ulcerative colitis 
patients and for the early detection of an emerging carci- 
noma in this high risk group of patients. 

In our study, cyclin A was detected in 80% of the patients 
and Cox regression analysis showed that it could be used 
as a prognostic marker in CRC in addition to cyclin Dl . 

It would have been useful if we assessed the expression 
level of cyclin A by another technique (DNA 
amplification). This would have added more information 
regarding the gene status on one hand and confirmed the 
results of 1HC on the other hand. Unfortunately, this was 
not possible because in most of the cases included in the 
present work, the extracted DNA was not sufficient to 
study cyclin amplification after the assessment of cyclin Dl . 

In 1996, Nagao et al. [11] reported that histone H3 labe- 
ling index significantly correlated with ki-67 immunos- 
taining and was high in poorly differentiated human 
hepatocellular carcinoma. This was similarly reported in 
the present work since we found a significant correlation 
between the SI of histone H3 and Ki-67. However, no 
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statistically significant correlation was found between his- 
tone H3 SI and any of the studied clinicopathologicai 
factors. 

Although Ki-67 immunostaining reflects the proliferative 
activity of CRC, it has not been recognized as a significant 
prognostic factor in this type of tumors [27,28]. However, 
Suzuki at al. [29] found a significant correlation between 
Ki-67 labeling index and local invasion of CRC. In the 
present study there was a significant relationship between 
the SI of Ki-67, tumor size and grade. However, Kaplan- 
Meier survival curves showed no significant difference in 
survival rates between patients with- and without overex- 
pression of Ki-67. 

Conclusions 

Our results demonstrate that cydin DL cyclin A, histone H3 
and Ki-67 are overexpressed in a subset of CRC, however 
only cydin Dl and cydin A overexpression correlates with 
poor differentiation and tumor progression. This indi- 
cates the superiority of cyclin A and cydin Dl as indicators 
of poor prognosis compared to Ki-67 and histone H3 
mRNA in CRC. Cydin A and Dl could therefore be consid- 
ered significant, independent prognostic factors in CRC 
patients. These findings are especially important in stage 
II patients since 25-30% of those patients have poor prog- 
nosis in spite of being node-negative. However, the stand- 
ard clinicopathologic prognostic factors can not identify 
this subset accurately and therefore; there is a great 
demand for more accurate, individually-based, biological 
prognostic parameters that help in detecting this high risk 
group of patients who can benefit from an adjuvant ther- 
apy. If the findings of the present study are confirmed in a 
larger study, evaluation of cydin A and DJ maybe applica- 
ble to clinical management of CRC, allowing the identifi- 
cation of patients with poor prognosis. 
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In this study, we analysed gene amplification, RNA expression and protein expression of the c-myc gene on archival tissue specimens 
of high-grade human breast cancer, using fluorescent in situ hybridisation (FISH), nonradioactive in situ hybridisation and 
immunohistochemistry. The specific question that we addressed was whether expression of c-Myc mRNA and protein were 
correlated with its gene copy amplification, as determined by FISH. Although c-Myc is one of the most commonly amplified 
oncogenes in human breast cancer, few studies have utilised in situ approaches to directly analyse the gene copy amplification. RNA 
transcription and protein expression on human breast tumour tissue sections. We now report that by using the sensitive FISH 
technique, a high proportion (70%) of high-grade breast carcinoma were amplified for the c-myc gene, irrespective of status of the 
oestrogen receptor. However, the level of amplification was low. ranging between one and four copies of gene gains, and the majority 
(84%) of the cases with this gene amplification gained only one to two copies. Approximately 92% of the cases were positive for c- 
myc RNA transcription, and essentially all demonstrated c-myc protein expression. In fact a wide range of expression levels were 
detected. Statistically significant correlations were identified among the gene amplification indices, the RNA expression scores and 
protein expression scores c-myc gene amplification, as detected by FISH, was significantly associated with expression of its mRNA. as 
measured by the intensity of in situ hybridisation in invasive cells (P = 0.0067), and by the percentage of invasive cells positive for 
mRNA expression (P = 0.0006). c-myc gene amplification was also correlated with the percentage of tumour cells which expressed 
high levels of its protein, as detected by immunohistochemistry in invasive cells (P = 0.001 6). Thus, although multiple mechanisms are 
known to regulate normal and aberrent expression of c-myc, in this study, where in situ methodologies were used to evaluate high- 
grade human breast cancers, gene amplification of c-myc appears to play a key role in regulating expression of its mRNA and protein. 
British Journal of Cancer (2004) 90, 1612- 1619. doi: 1 0. 1 038/sj.bjc.660 1 703 www.bjcancer.com 
Published online 30 March 2004 
© 2004 Cancer Research UK 
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The c-mj»c oncogene has been shown to be amplified and/or 
overexpressed in many types of human cancer (Marcu et al, 1992; 
Nass and Dickson, 1997; Nesbit et al, 1999; Liao and Dickson, 
2000). Numerous experiments in vivo have also causally linked 
aberrant expression of this gene to the development and 
progression of cancer in different body sites (Marcu et al, 1992; 
Nass and Dickson, 1997; Nesbit et al, 1999; Liao and Dickson, 
2000). However, several critical issues regarding the significance of 
c-myc in human cancer still remain obscure, First, even for a given 
type of malignancy, the frequencies of the alterations of c-myc at 
the cytogenetic and expression levels vary gready from one report 
to another (Liao and Dickson, 2000). For instance, the frequencies 
of its amplification, mRNA and protein overexpression in breast 
cancer vary between 1-94, 22-95 and roughly 50-100%, 
respectively, among different reports (Liao and Dickson, 2000). 
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Thus, it is still unclear to what extent this gene is altered at the 
cytogenetic level and at different expression levels in breast 
carcinoma. 

One controversial issue pertains to the prognostic value of c-myc 
gene alterations in cancer. The central role of c-Myc protein in 
accelerating cell proliferation, documented by many early studies, 
has led to a general concept for many types of cancer that 
amplification or overexpression of this gene may be associated 
with a more aggressive tumour and a poorer patient survival 
(Berns el al % 1992; Marcu et al, 1992; Sato et at, 1995; Nass and 
Dickson, 1997; Nesbit et at, 1999; Visca et a/, 1999; Liao and 
Dickson, 2000), However, many reports have shown an opposite 
correlation (Sikora et al, 1985, 1987; Watson et al, 1986; Polacarz 
et al, 1989; Voravud et al, 1989; Williams et al, 1990; Melhem et al, 
1992; Pietilainen et al, 1995; Diebold et al, 1996; Smith and Goh, 
1996; Augenlich et al, 1997; Bieche el al, 1999), while other studies 
do not support either of these conclusions. For instance, gene 
amplification or overexpression of c-Myc protein has also been 
shown to associate with a better tumour differentiation or a better 
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patient survival for cancer of the testis, ovary, bile ducts, colon and 
breast (Sikora et al, 1985, 1987; Watson et al, 1986; Polacarz et al, 
1989; Voravud et al, 1989; Williams et al, 1990; Melhem et al, 1992; 
Pietilainen et al, 1995; Diebold et al, 1996; Smith and Goh, 1996; 
Augenlich et al, 1997; Bieche et al, 1999). This controversy does 
not appear to be related completely to the cancer type, since both 
positive (Berns et al, 1992; Visca et al, 1999) and negative 
(Williams et al, 1990; Melhem et al, 1992; Pietilainen et al, 1995; 
Smith and Goh, 1996; Augenlich et al, 1997; Bieche et al, 1999) 
correlations have been reported for colon cancer and breast 
cancer. More interestingly, c-Myc overexpression has been shown 
to predict a poorer prognosis for cutaneous melanoma, but a 
favourable outcome for uveal melanoma (Grover et al, 1997; Chana 
et al, 1998a, b, 1999; Grover et al, 1999). These data indicate 
different roles of c-Myc, even in the same type of tumour, perhaps 
depending upon different tissue microenvironments. 

Another controversial issue concerns the nuclear -cytoplasmic 
localisation of c-Myc. Studies of neoplasms of the colon, testis, 
ovary and liver have shown that predominandy nuclear localisa- 
tion of c-Myc tends to occur in benign lesions, while cytoplasmic 
localisation tends to occur in more malignant tumours (Sikora 
et al, 1985; Sundaresan et al, 1987; Melhem et al, 1992; Sasano et al, 
1992; Yuen et al, 2001), Whether these patterns of subcellular 
localisation of c-Myc tend to reflect the malignant status of breast 
cancer remains an enigma. 

A recent study of the impact of DNA amplification on gene 
expression patterns in breast cancer used mRNA and DNA from 14 
breast cancer cell lines. Analysis was conducted with a 13000 
cDNA clone array for gene expression measurement and a 
Comparative Genomic Hybridisation (CGH) microarray for gene 
copy number measurements. This study also included known 
breast cancer genes, such as c-myc, HER/2-neu and aibl (Hyman 
et al, 2002). Interestingly, 44% of the most highly amplified genes 
were also overexpressed at the mRNA level. Consistent with this 
pattern, c-Myc gene copy number and its expression levels showed 
a statistically significant (a = 0.020) correlation in this microarray 
study of breast cancer cell lines. Another study, by Pollack and 
colleagues, used microarray analysis and BAC array CGH of RNA 
and DNA (respectively) extracted from intermediate grade human 
breast tissues, and tested for amplification and expression of c- 
Myc (among other genes). This study demonstrated that two out of 
37 specimens were both amplified and overexpressed, while others 
were either amplified or overexpressed, but not both. The authors 
of this study suggested that contaminating stromal tissue may 
compress the fluorescence ratios leading to underestimates of gene 
amplification and overexpression (Pollack et al, 2002). 

To more clearly address the importance of gene amplification 
and expression of c-Myc in human breast cancer, we used in situ 
methodologies, which can clearly distinguish stromal and carci- 
noma components. We studied the amplification and over 
expression of the c-myc gene with fluorescent in situ hybridisation 
(FISH), non-radioactive in situ hybridisation (ISH) and immuno- 
histochemical (IHC) approaches on paraffin-embedded biopsy 
sections of untreated, high-grade breast cancer. It was observed 
that 70, 92 and 70% of the cancer cases exhibited c-myc gene 
amplification, its mRNA overexpression and its protein over 
expression, respectively. In most of the cases (84%) that showed 
gene amplification, the c-myc gene gained only one to two copies, 
which is consistent with c-myc FISH data from other studies. 
Unlike some oncogenes, such as N-myc, which typically demon- 
strates gene amplification copy numbers of greater then 10 in 
neuroblastoma, and HER-2/neu (Sartelet et al, 2002), whose copy 
numbers range up to 14-40 in breast carcinomas (Isola et al, 
1999), gene copy numbers of c-myc are not as greatly increased. In 
the study noted earlier, using breast cancer cell line CGH array and 
cDNA microarray expression analysis, it was demonstrated that 
the most dramatically increased expression levels were associated 
with large gene copy number increases, although low-level gains 
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and losses had a significant influence on gene expression 
dysregulation (Hyman et al, 2002). Only one study has been 
published (Pollack et al, 2002) that has begun to determine if these 
findings are directly relevant to actual human breast tumour 
tissues, since many of the genetic changes in tissue culture cell 
lines are more extreme than those displayed in primary tumour 
material. Furthermore, the relationships among gene amplifica- 
tion, mRNA expression and c-Myc protein expression were not 
explored in prior human breast cancer cell line and tumour tissue 
studies (Hyman et al, 2002; Pollack et al, 2002). 

In our human breast tumour tissue study, a high correlation was 
found between c-myc FISH and ISH, for both percentage of 
staining (P<0.0067) and intensity positive cells (P<0.0006). In 
addition, c-myc gene copy amplification by FISH was correlated 
with c-Myc protein expression positive cells by IHC (P< 0.0016). 
These results support the idea that c-Myc overexpression of both 
mRNA and protein is related to the copy number of the c-myc 
DNA amplification. We show in this study that amplification and 
overexpression of c-Myc occur with high frequency in high-grade 
human breast cancer tissues. 



MATERIALS AND METHODS 
Materials 

Formalin-fixed, paraffin-embedded tissue blocks of breast carci- 
noma and normal breast tissue were obtained from the 
Histopathology and Tissue Shared Resource at the Lombard i 
Comprehensive Cancer Center (LCCC), at Georgetown University 
Medical Center. The criteria for tumour selection were the 
following: negative progesterone receptor status, metastases to 
auxiliary lymph nodes and high grade (Elston Score >7). The 
oestrogen receptor status of the tumours was known from archived 
pathology reports. The parameters were chosen from our prior 
meta-analysis (Deming et al, 2000), as indications of a high 
likelihood of c-myc gene amplification. Normal breast tissue 
specimens were from reduction mammoplasty. Serial sections 
(5/tm) for FISH, ISH and IHC were prepared by the LCCC 
Histopathology and Tissue Shared Resource. 

FISH 

A dual-label FISH technique was used (Jenkins et al, 1997). Slides 
were baked overnight at 60°C to assure adherence of the sample. 
Tissue sections were deparaffinised with two successive, lOmin 
xylene washes, and then dehydrated in a graded ethanol series of 
70, 80 and 95% at room temperature. Samples were then digested 
with 4% pepsin (Sigma, St Louis, MO, USA) at 45°C for lOmin. 
DNA probes used were an alpha satellite probe to chromosome 8, 
labelled with biotin, and a c-myc probe, labelled with digoxigenin 
(Ventana, Tucson, AZ, USA). Codenaturation was performed at 
90° for lOmin on a hot plate. Hybridisation was at 37°C for 12- 
16h. Detection of signals was accomplished with an antiavidin 
antibody labelled with Texas Red, and an an ti digoxigenin antibody 
conjugated to fluorescein (Ventana, Tucson, AZ, USA). Slides were 
postwashed in 2 x SSC at 72°C for 5 min and counterstained with 
DAPI to visualise cell nuclei. Results were viewed and quantified 
with a Zeiss Axiophot fluorescence microscope, equipped with 
appropriate filters and an Applied Imaging Cytovision system 
(Pittsburgh, PA, USA). In this approach, the c-myc unique 
sequence probe was visualised as a green signal and the control 
probe for the chromosome 8 centromere was red, thus easily being 
distinguished when scored. 

One serial section from each tumour sample was stained with 
haematoxylin and eosin and first reviewed by a pathologist (BS), to 
help identify the tumour area of the section. This procedure 
ensured that the tumour cells, but not the normal cells, were 
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counted. Nuclei of up to 50 tumour cells were scored from each 
FISH-stained section, independently by two investigators. Hybri- 
disation signals were averaged, and the amplification index was 
presented as the number of c-myc signals divided by the number of 
chromosome 8 centromere signals. A 1.8 -fold increase was used as 
the criterion to judge the presence of c-myc gene amplification. 

In situ hybridisation 

In situ hybridisation (ISH) was carried out with a nonradioactive 
method, described previously (Liao et al, 2000a, b). One serial 
section from each specimen was hybridised overnight at 60°C with 
riboprobes, that were in vitro transcribed from the antisense or 
sense strand of an approximately 300 bp cDNA of human c-myc 
(ATCC, Manassas, VA, USA), labelled with digoxigenin-conjugated 
UTP. The sections were then incubated with an antibody against 
digoxigenin, followed by incubation with a second antibody 
conjugated to alkaline phosphatase. The signal was visualised by 
colour development with 5 bromo-4-chloro-3-indoIyl phosphate 
and rutroblue tetrazolium. All reagents were purchased from 
Boehringer Mannheim, Indianapolis, IA. To control the signal 
specificity, two serial sections were mounted on the same slide for 
hybridisation with the antisense and sense probes, respectively. 
ISH was given an intensity and percentage scores, based on 
intensity of positive staining and number of cells staining, 
respectively. Intensity scores were assigned 0, 1, 2 and 3, and 
percentage scores were assigned as 1- 1-25, 2- 26-50, 3- 51-75 
and 4. 76-100%. 

Immunohistochemistry 

Immunohistochemical staining (IHC) was performed using an 
avidin-biotin complex (ABC) method described previously (Liao 
et al, 1998). One serial section of each specimen was deparaffinised 
and blocked with 3% peroxide. Antigens were retrieved by heating 
slides in a microwave oven in 50 mM citrate buffer, pH 6.4, at 
boiling temperature, for 12min. After blocking with 6% normal 
goat serum, the section was incubated with a mouse monoclonal 
antibody to human c-Myc (9£10, Sigma Chemical Company, St 
Louis, MO, USA) at 1:100 dilution for 2h, followed by Ih 
incubation with a second antibody conjugated with biotin (Vector 
Laboratories Inc., Burlingame, CA, USA). The section was then 
incubated with peroxidase-conjugated avidin (Dako, Corporation, 
Carpinteria, CA, USA) for 30min, followed by colour development 
with diaminobenzidine and peroxide. All procedures were carried 
out at room temperature. To control the signal specificity, serial 
sections from 10 tumour samples were also stained using an 
alternate c-Myc antibody (CI 9 from Santa Cruz Biotechnology 
Inc., Santa Cruz, CA, USA) at 1 : 60 dilution. This antibody resulted 
in focally positive staining in the tumour, but the staining intensity 
was weaker. To control die signal specificity, serial sections were 
made from five selected positive cases which were subjected to the 
same staining procedure, with a normal mouse IgG to replace the 
c-Myc antibody. This control staining did not give rise to a signal, 
demonstrating the specificity of the c-Myc antibody signal. IHC 
staining was given an intensity and percentage score based upon 
the intensity of positive staining and number of cells staining. 
Intensity scores were assigned 0, 1, 2 and 3 and percentage scores 
were assigned as 1- 1-25, 2- 26-50, 3- 51-75 and 4- 76-100%. 
Determinations were made of cellular localisation of c-Myc 
antibody staining to cytoplasm and/or nucleus in normal and 
invasive cells within each breast tumour specimen. 

Statistical analyses 

For each analysis of gene copy amplification (FISH), mRNA 
expression (ISH) and protein expression (IHC), all cases were first 
grouped as positive or negative to calculate the percentages of 



positive cases and negative cases, as described (Zar, 1974). Fisher's 
exact test was used to compare percentages, and two-sample i-test 
or Wilcoxon rank test was used to compare average scores. Both 
ISH and IHC were given intensity and percentage scores, based on 
intensity of positive staining and number of cells staining, 
respectively. As noted earlier, intensity scores were assigned 0, 1, 
2 and 3 and percentage scores were assigned as 1- 1 - 25, 2- 26-50, 
3- 51-75 and 4- 76-100%. A score of >2 for either intensity of 
staining or percentage of cells positive by ISH was assigned as 
high. For IHC, an intensity score of > 1 was assigned as high and a 
percentage score of >3 was categorised as high. Each amplifica- 
tion index was paired with its corresponding mRNA expression 
score to calculate the coefficient r. The same method was used to 
estimate the association of the amplification indices with the c- 
Myc protein expression levels, and the association of the mRNA 
expression levels with the protein expression levels. A F-value of 
0.05 or less was used to determine the statistical significance in all 
analyses. In all, 54 pairs of normal vs invasive tissues were analysed 
using McNemars x 2 test to determine if there was a difference in 
cellular localisation of c-Myc antibody signal to nuclear or 
cytoplasmic compartments. 



RESULTS 

FISH analysis of gene amplification 

Amplification of the c-myc gene was measured by a FISH test in 46 
cases of breast cancer; Figure 1 demonstrates cells with no 
amplification (one copy of c-myc /one copy of chromosome 8 
centromere, and a moderate amplification a 3/1 ratio). Amplifica- 
tion was calculated by the number of c-myc signals divided by the 
number of chromosome 8 alpha satellite signals. A 1.8-fold 
increase cut-off was used to judge gene amplification. As shown 
in Table 1, 32 out of 46 (70%) cases were gene amplified for c-myc, 
whereas only 30% (14/46) of the cases showed amplification 
indices lower than the cut-off value. The amplification indices for 
most (84%, or 27/32) cases with gene amplification, ranged 
between 1.8- and three-fold, indicating that the locus gained up to 
two copies of c-myc in the majority of the cases. The percentage of 
cases with gene gains of three copies or higher was 1 1% (five out of 
46) of total cases analysed, or near 16% (five out of 32) of the cases 
with gene amplification, including one case (2% of total cases or 
3% of the cases with gene amplification) with the highest index of 5 
(a gain of four copies). 

In all, 28 of the breast carcinomas in this study were ER 
negative, and 14 were ER positive. The average c-myc gene 
amplification score was 1.896 (s.e. = 0.196) for ER positive and 
2.201 (s.e. = 0.157) for ER negative. Although ER-negative tumours 
had a slightly higher average c-myc score, the difference was not 
statistically significant (two-sided P = 0.252 from two-sample /-test 
and 0.251 from Wilcoxon rank test), consistent with the results of 
our prior meta-analysis of the literature (Deming et aU 2000). 



In situ hybridisation analysis of c-myc mRNA expression 

A total of 51 breast cancer samples were studied for c-Myc mRNA 
expression, with non radioactive in situ hybridisation (ISH). ISH 
results were assigned intensity and percentage scores based upon 
signal intensity of positive staining and number of cells staining 
within the sample, respectively. As shown in Table 2, 86% (44 out 
of 51) tumours were scored as high in intensity, and 92% (47out of 
51) had more than 51% positive cells, also considered as highly 
increased c-Myc expression. mRNA expression was heterogeneous 
in the breast tumour tissue, and no morphologic subtype was 
predominant in the high or low categories. One case showed no c- 
Myc ISH staining. In 79% (38/48) of cases, epithelia in normal 
mammary glands adjacent to the tumour also showed a high 
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Figure I RSH analysis of c-myc amplification in tumour cells from breast tumour tissue sections. FISH probe for human c-myc unique-sequence is seen as 
green, while the normal control signal, a centromeric probe signal for chromosome 8 is shown in red. The nuclei of tumour cells were visualised by DAPI 
counter-staining, (A) 1:1 copy ratio of c-myc to chromosome 8 (c-myc/8 centromere), indicating no amplification of c-myc in tumour cells. (B) 1:3 copy 
ratio of c-myc to chromosome 8 (c-myc/B centromere), a moderate amplification of the c-myc gene. 



Table I c-myc gene copy amplification analysis by RSH in poor 
prognosis human breast tumour samples 



Amplification index 




(#c-myc signals/* 


Percentage of samples with 


control signals)* 


FISH ratios in each category 


1.0-1.7 


30% 14 out of 46 


1.8-1.99 


20% Nine out of 46 


2.0-2.9 


39% 18 out of 46 


>3.0 


11% Five out of 46 



Analysis was conducted on 46 individual paraffin-embedded tissue samples with 
negative progesterone receptor status, positive lymph node involvement and high 
tumour grade. + Normal control ratio is I. 



intensity of staining. In three cases, no staining was seen in the 
normal terminal duct lobular units. Figure 2 shows representative 
fields of high, medium and low c-myc mRNA expression levels in 
invasive ductal carcinoma samples. 

Association of FISH and ISH 

c-Myc scores were dichotomised as binary variables (high or low), 
and a score of 2 or higher was categorised as high on ISH. A score 
higher than median was categorised as high from FISH studies. 
These dichotomised scores are depicted in Table 3. A Fisher's exact 
test was performed for comparing binary responses to see if there 
was any association between FISH and ISH. It was found that the 
FISH score was significantly associated with percentage of staining 
in the invasive cells (P = 0,0067, two-sided McNemar's test) and 
also with the intensity score on ISH (P = 0.0006, two-sided). 



Immunohistochemical staining of c-Myc proteins 

In total, 51 breast carcinomas, which were subjected to FISH 
analysis, and all of which also had been analysed for c-myc mRNA 
by in situ hybridisation, were also analysed for the expression of c- 
Myc protein, using immunohistochemical staining with the 9E10 
antibody. IHC results were assigned an intensity and percentage 
score based on intensity of positive staining and number of cells 
staining, respectively. Intensity scores were assigned 0, 1, 2 and 3 
and percentage scores were assigned as 0, 1- 0-25, 2- 26-50, 3- 



51-75 and 4- 76-100. For IHC, an intensity score of >l was 
assigned as high and a percentage score of > 3 was categorised as 
high. Figure 2 shows examples of high, medium and low levels of c- 
myc antibody staining in invasive ductal carcinoma samples. In 34 
cases, normal tissue was seen; 30 of these showed cytoplasmic 
staining and 22 had nuclear staining in terminal ductal lobular 
units. In all, 12 cases showed 1 + , 14 cases 2 + and four cases 3 + 
cytoplasmic staining. In situ hybridisation revealed positive 
staining in 46 out of 49 cases with normal tissue. Seven cases 
showed I+, 13 cases showed 2-1- and 26 cases showed 3 + 
staining by ISH. Both immunohistochemistry and in situ 
hybridisation showed diffuse positivity in adipocytes. 

Table 4 shows the staining pattern for the cohort In all, 70% (36 
out of 51) of cases showed high intensity of staining for c-Myc 
protein, while 85% (29 out of 34) of cases with detectable staining 
had more than 76% positive cells, also considered as high 
expression. To verify the staining specificity, serial sections from 
10 tumour specimens that were positive for 9E10 antibody were 
also stained using the CI 9 rabbit polyclonal anti-c-Myc antibody. 
Results revealed a staining pattern similar to 9E10. However, the 
staining intensity with CI 9 was weaker than 9E10. The specificity 
of these two antibodies was verified by Western blots in previous 
studies (Persons et al> 1997; Liao et al t 2000b). Figure 2 shows 
results of c-Myc in situ hybridisation and immunohistochemistry 
studies on samples considered to demonstrate low, moderate and 
high levels of c-Myc expression. Analysis of c-Myc protein 
localisation results in the nucleus or cytoplasmic compartments 
of normal and invasive cells within the tumours revealed that 
nuclear staining was positive in 41% of normal cells, compared to 
22% of invasive cells (statistical significance at P = 0.01 by 
McNemar's two-sided % 2 test). The increase in relative cytoplasmic 
localisation of c-Myc protein, comparing normal (53.7%), to 
invasive cells (61.1%) was not significantly different. Thus, the data 
are consistent with partial exclusion of c-Myc from the nuclei of 
invasive breast cancer cells. 

The FISH score was significantly associated with the percentage 
positivity of invasive cells, as seen on IHC studies of c-Myc. 
However, 40% of tumours displayed a low index of c-myc gene 
amplification, but still expressed high levels of c-Myc protein 
(Table 6), indicating the possibility of other mechanisms of over 
expression unrelated to gene amplification in at least some 
tumours. The FISH score was not significantly associated with 
the intensity of IHC staining in the invasive cells (not shown), in 
contrast to the IHC percentage positivity score. 
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Table 2 c-myc mRNA in situ hybridisation (I5H) results 



Staining intensity 0 I 2 3 Percent posltivity 12 3 4 

Number of tumour samples in each category N= 5! I 6 25 19 Number of tumour samples in each level category N- 51 I 3 5 42 



In all. 51 human high-grade breast carcinomas were analysed to determine the relationships between c-Myc mRNA expression and c-myc gene in situ hybridisation results. Data 
are shown in two ways in the above table. First overall staining intensity of c-Myc-positive cells was scored as 0, I , 2 3 (tow to high), and the number of tumour samples at each 
level of staining indicated on the line below. Next the percentage of tumour cells staining was scored as 0. 1. 2. 3. A (low to high %. as discussed in Materials and Methods). The 
number of tumours at each lewd of percent cell positivity for c-Myc is then indicated on the line below. 




PS 
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Figure 2 Immunohistochemical staining and in situ hybridisation for c-Myc of three sets of invasive ductal carcinoma. (A. C and E) High (3 + ). 
intermediate (2 + ) and low (I -J- ) level of staining by immunohistochemistry for c-Myc (B, D and F) High (3 + ). intermediate (2 + ) and low ( I 4- ) level of 
staining by in situ hybridisation. 



DISCUSSION 

Although there have been many reports on c-myc amplification in 
human breast cancer (Liao and Dickson, 2000), there are only two 
published studies involving application of the FISH technique to 
unfixed, frozen sections (Persons et al, 1997; Visscher et al, 1997), 
and one prior study using FISH on an archival human tissue 
microarray (Schraml et al, 1999). Another recent study applied 
FISH to evaluate c-myc amplification in ductal carcinoma in situ 



(DCIS) (Aulmann et al, 2002). Using the FISH technique on 
formalin-fixed, paraffin-embedded sections, we now show that 
70% of high-grade breast cancer samples bear c-myc gene copy 
amplifications. Interestingly, the above-mentioned study, using 
FISH and focusing on DCIS, detected amplification of c-myc in 
only 20% of cases, but found a correlation of c-myc with increased 
tumour size and proliferation (Aulmann et al, 2002). 

The level of amplification of c-myc in our study ranged between 
one and four additional copies of the gene; the majority (84%) of 
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the cases with the gene amplification gained only one to two 
copies, also consistent with FISH data reported for c-myc copy 
amplification in human metastatic prostate carcinoma tissues 
(Jenkins et ah 1997). The relationship between the level of c-myc 
gene copy amplification and the level its increased mRNA 
expression has been examined previously in breast cancer cell 
lines (Hyman et ah 2002). In general, it has been concluded that 
the two scores coordinate for c-myc, as is the case for many breast 
cancer genes. However, only 44% of the highly amplified genes, in 
general, showed increased RNA expression, and only 10.5% of the 
highly overexpressed genes were gene copy-amplified in the cell 
line study (Hyman et ah 2002). Another analysis was conducted to 
study of relationships between gene amplification and expression 
of 6095 genes in 37 intermediate grade human breast tumours. 
This study demonstrated that 62% of the highly amplified genes 
also showed elevated expression; overall, a two-fold change in 
DNA copy number was associated with a 1.5-fold change in mRNA 
levels. Overall, 12% of the variation in gene expression in the 
breast rumours studied was associated with gene copy number 
variation (Pollack et ah 2002), Further study of additional human 
breast tumours, at precisely defined grades and stages, will be 
necessary in order to more fully define the relationships between 
DNA copy numbers and expression of genes. The studies we report 
here indicate higher levels of c-Myc gene amplification and 
expression, than other previous reports in breast cancer. We 
believe that this is probably the result of our analysis of individual 
tumour cells in a well-defined set of high-grade breast tumours. 
Prior c-Myc expression and amplification microarray studies used 
tumour specimens which contain normal stromal components, 



Table 3 Correlations between c-myc gene copy number (RSH) mRNA 
expression (ISH) 



FISH 



potentially underestimating amplification and expression levels of 
the invasive tumour components (Pollack et ah 2002). 

Our study reports a percentage of tumours gene amplified for c- 
myc (using FISH in high-grade tumours) that is much higher than 
the average figure (15.5%) reported in the literature (Isola et ah 
2002). Most of the prior studies have employed the relatively 
insensitive Southern blot technique, and were reviewed in a 
recent meta-analysis (Deming et ah 2000). Consistent with this 
prior literature background, a recent study of 94 lobular and 
ductal breast cancers assessed amplification of c-myc by 
using a semiquantitative PCR assay and protein expression, with 



Table 5 Nuclear/cytoplasmic localisation of c-Myc comparing normal 
and invasive cells 



Normal cells 
(frequency percent) 


Invasive cells 
(frequency percent) 




Total 


(A) Nuclear localisation 




+ 






28 




32 


+ 


14 


8 


22 


Total 


42 


12 


54 


(B) Cytoplasmic localisation 










12 


13 


25 




9 


20 


21 


Total 


21 


33 


S4 



In all, 54 pairs (normal vs invasive) of tissues were analysed to answer the questions of 
(I) whether positivity of nuclear ceBs in normal tissues is different from that in 
invasive cells, and similarly (2) whether positivity of cytoplasmic cells in normal tissues 
is different from that in invasive cells. The data are summarised in the above 
contingency tables. In all 22 normal cell specimens were positive for c-Myc staining 
(40.71%). compared to 12 specimens (22.2%) in invasive cells. The difference is 
statistically significant (P = O0l) by McNemar's x* test (two-sided). 



Low 



High 



(A) ISH {% cells) 

Low 

High 



(B) ISH (intensity) 

Low 

High 



/ 

19 



2 

18 



3 

/8 

P = 0.0067 



5 
16 

P = 0.0006 



Serial sections of high-grade human breast carcinomas were scored for c-myc gene 
copy number (FISH, Table I) and mRNA expression (ISH, Table 2). In (A), a positive 
correlation (P - 0,0067) was observed between tumour samples with a high 
percentage of cells demonstrating mRNA expression and a high c-myc gene copy 
number. A score of 2 or higher was classified as high on ISH. and a score of median or 
greater was categorised as high on FISH. In (B), a positive correlation (P = 0,0006) 
was shown between a high level of intensity for c-Myc RNA expression a high and c- 
myc gene copy number. Note that a pairwise comparison of FISH and ISH was not 
possible for all cases, due to incomplete overlap of cases analysed with each assay. 



Table 6 Correlation between c-Myc protein expression (IHC) and c- 
myc gene copy number (RSH) 



IHC (% cells) 




FISH 




Low 




High 


Low 


3 




0 


High 


10 




15 








P = 0.0016 



Consecutive serial sections of high-grade human breast tumours were scored for c- 
vnyc gene copy number or protein expression, by immunohistochemistry (IHC). IHC 
scores were defined in the Materials and methods section. Data were analysed for 
correlations between the results. A highly significant correlation was observed 
between high c-Myc protein expression (IHC) between percent cells positive and 
high c-myc gene amplification (FISH). P = 0.001 6 from two-sided McNemars test. 
Note that for 15 cases, no staining for c-Myc could be detected: these negative cases 
were not included in the correlation presented, above. 



Table 4 c-Myc immunohistochemistry (IHC) results 



Staining intensity 


0 


1 


2 


3 


Percent positivity 


I 


2 


3 


4 


Number of tumour samples in each category 


15 


13 


20 


3 


Number of tumour samples in each category 


2 


2 


1 


29 



In all. 5 1 high-grade human breast carcinomas were analysed to determine the relationships between c-Myc protein expression and c-myc gene in situ hybndisation results. Data 
are shown in two ways in the above table. First overall staining intensity of c-Myc-positive cells was scored as 0, 1 . 2. 3 (low to high), and the number of tumour samples at each 
level of staining is indicated on the line below. Next, in a random subset of these cases, the percentage of tumour cells staining was scored as 0. I. 2, 3. 4 (low to high %, as 
discussed in Matenals and methods). The number of tumours at each level of percent cell positivity for c-Myc is indicated on the line below. 
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densitometry, after Western blot. These data showed c-myc gene 
amplification in 21% of tumours (Jenkins et a/, 1997), using assays 
not based on in situ discrimination of tumour vs nontumour cells. 
The lower frequency of c-myc in this prior study is in contrast with 
the data we present here, and could be the result of the higher 
sensitivity and precision of the FISH and immuiiohistochemical 
methods, as distinct from quantitative PCR and Western blot 
densitometry. In addition, the 70% of amplified tumours in our 
study is also much higher than the 12% reported by Schraml et al 
(1999), using a c-myc FISH test on a tissue microarray. This large 
difference may be because the arrays are prepared from cores of 
paraffin- embedded tissue, as small as 0.6 mm in diameter which 
may contain too few tumour cells for complete analysis of 
amplification of a gene, such as c-myc. c-myc is known to be 
quite heterogeneous in its gene amplification within individual 
tumours (in contrast to HER2/neu t for example) (Persons et al, 
1997). 

Most previous reports on the expression of c-myc mRNA have 
utilised Northern blot, dot blot or PCR-based approaches, while 
just a few involved in situ hybridisation, which were primarily 
performed on frozen tissue sections (Liao and Dickson, 2000). 
Normal breast tissue is dominated by adipose cells, differing 
greatly from tumour tissue in its epithelial cellularity. Thus, 
normal and tumour tissues may not be rigorously compared by 
techniques involving RNA extraction from total tissue. Therefore, 
conclusions such as 'increased expression 1 may be more difficult to 
make from studies with Northern blot, dot blot and PCR-based 
techniques that require RNA extraction from tissues that have not 
been fastidiously micro-dissected for selection of tumour cells. 
Using a more sensitive, nonradioactive in situ hybridisation (ISH) 
approach on formalin-fixed, paraffin-embedded sections, we 
report herein high expression of c-myc mRNA in 92% of high- 
grade breast carcinomas. This figure is much higher than the 
recendy reported data (22%), obtained by using a real-time RT- 
PCR method (Bieche et al, 1999). Dilution of the RNA from 
epithelium by the RNA from adipose in normal breast tissue in this 
latest prior report may be one of the possible explanations for this 
large difference. 

In conclusion, the present study shows that approximately 70, 92 
and 70% of biopsies of untreated high-grade breast cancer exhibit 
c-myc gene amplification, mRNA overexpression and protein 
overexpression, respectively. In most cases (84%), with gene copy 



amplification, the c-myc gene gains one to two additional copies, c- 
myc gene amplification was significantly associated with expres- 
sion of its mRNA (both by intensity in invasive cells and by 
percentage positivity in invasive cells), and with expression of its 
protein (by percentage positivity in invasive cells). However, our 
data were also consistent with the prior literature on c-Myc 
(reviewed in Nass and Dickson, 1997; Liao and Dickson, 2000), 
indicating complex transcriptional, post transcriptional, transla- 
tion^ and post-translational control of c-Myc expression in vitro. 
Specifically, in Table 5 we observed that in 40% of the high-grade 
tumours tested, c-Myc protein was expressed at high levels, despite 
a lack of its gene amplification. 

It will be interesting to analyse lower grade tumours and 
prernalignant lesions, with the same measurement tools, to 
determine if this c-myc amplification pattern is different, 
comparing different steps in onset and progression of the 
disease. Specifically, prior studies in fibroblasts and in human 
mammary epithelial cells (Liao et al, 1998, 2000a, b) have 
demonstrated that only a subtle deregulation of expression of c- 
Myc is sufficient to allow genomic instability. These prior cell 
biologic findings raise the question of whether c-Myc protein 
expression precedes or follows its gene amplification during the 
course of the natural history of breast cancer. It will also be 
interesting for future studies of lower grade breast cancers and 
prernalignant lesions to determine whether there is evidence of 
nuclear exclusion of c-Myc protein. Indeed, nuclear exclusion 
of c-Myc in high-grade tumours could serve to attenuate its 
functions in later stages of disease progression (Liao and Dickson, 
2000). 
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DECLARATI ON OF VICTORIA SMITH. Ph.D.. UNDER 37 CFR S1.132 

Commissioner for Patents 

PO. Box 1450 

Alexandria, VA 22313-1450 

Dear Sir: 

I, Dr. Victoria Smith, declare and state as follows: 



1. I am a Senior Scientist in the Department of Molecular Biology of Genentech, 
Inc., 1 DNA Way, South San Francisco, CA 94080. 

2. My scientific Curriculum Vitae, including my list of publications, is attached to 
and forms part of this Declaration (Exhibit A). 

3. I joined Genentech in 1996. For approximately three years, Idirected a laboratory 
in the Department of Molecular Biology. During this time I was involved in target discovery for 
the Tumor Antigen Project, using DNA microarrays to discover genes differentially expressed in 
tumors compared to their expression in normal tissues. In connection with the above-identified 
patent application, I directed the generation and analysis of the microarray data attached as 
Exhibit B. 

4. Exhibit B reports the results of the microarray analysis conducted on the gene 
encoding PRO1800 (DNA35672) as part of the investigation of several newly discovered DNA 
sequences. The column "Unq Id" identify the gene as 851, which is DNA35672, while the 
column "DNA Id" identifies the particular lot of PCR product used. The microarray experiments 
were performed using well-established and accepted microarray techniques known in the art. 
{See, e.g., Nature Revs. Genetics, 5:229-237 (2004), attached as Exhibit C). The DNA samples 
used in the microarray studies were obtained from individual lung tumor tissue samples or 
individual normal lung tissue samples. The individual tumor and normal lung samples were each 
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compared to pooled samples of normal epithelial tissue. The level of expression in the lung 
tumor or normal lung tissue sample was compared to the normal pooled epithelial sample, and 
reported as a raw ratio. The average of the normal lung samples was then used to normalize the 
data to generate a ratio of expression of the PRO 1800 gene in lung tumor samples compared to 
the average expression in normal lung tissue. In the results reported in Exhibit B, a ratio of 2.0 
or greater is a significant result, and indicates a significant increase in expression of the 
PRO1800 gene in lung tumor tissue compared to the normal lung tissue controls. 

5. The results of the microarray studies reported in Exhibit B indicate that the gene 
encoding PRO1800 (DNA35672) is significandy overexpressed in nine of the eighty lung tumor 
samples tested compared to the normal lung tissue controls. That is the equivalent of one in 
every nine samples. In contrast, none of the individual normal lung tissue samples show 
significant overexpression of the PRO1800 gene. In addition, the average ratio of the lung tumor 
samples is significantly different from the average ratio of the individual normal lung tumor 
samples (p < 0.01). 

6. It is well-established in the art that overexpression of the mRNA for a gene is 
likely to lead to overexpression of the corresponding protein. Support for this statement can be 
found, for example, in the Molecular Biology of the Cell, a leading textbook in the field. (Bruce 
Alberts, et al, Molecular Biology of the Cell (4* ed. 2002), excerpts submitted herewith as 
Exhibit D). Figure 6-3 on page 302 illustrates the basic principle that there is a correlation 
between increased gene expression and increased protein expression. The accompanying text 
states that "a cell can change (or regulate) the expression of each of its genes according to the 
needs of the moment - most obviously by controlling the production of its mRNA." Molecular 
Biology of the Cell at 302, emphasis added. Similarly, figure 6-90 on page 364 illustrates the 
path from gene to protein. The accompanying text states that while potentially each step can be 
regulated by the cell, "the initiation of transcription is the most common point for a cell to 
regulate the expression of each of its genes." Molecular Biology of the Cell at 364. This point is 
repeated on page 379, where the authors state that of all the possible points for regulating protein 
expression, "[f]or most genes transcriptional controls are paramount." Molecular Biology of the 
Cell at 379. 

7. While not every lung tumor sample tested shows overexpression of the PRO1800 
gene, the data in Exhibit B indicate that a significant portion of lung tumors do (one in every 
nine), while none of the normal lung tissue samples show overexpression. Given the known 
correlation between overexpression of a gene and the corresponding overexpression of the 
encoded protein, it is very likely that a similar number of lung tumors will overexpress the 
PRO 1800 protein, while normal lung tissue samples will not. Together with the data reported in 
Example 16 that the gene encoding PRO 1800 is amplified in some lung tumors, the results 
reported in Exhibit B indicate that the PRO1800 gene and protein, as well as antibodies to the 
encoded protein, can be used to differentiate some cancerous lung tissue from normal lung tissue. 
Because not all lung tumors show overexpression. of PRO1800, it cannot be used to exclude a 
sample being tested as non-cancerous. However, the PRO1800 gene, protein, and corresponding 
antibodies are useful as a diagnostic tool since a significant portion of lung tumors overexpress 
the gene and most likely the encoded protein, while no normal lung samples do. 
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8. I hereby declare that all statements made herein of my own knowledge are true 
and that all statements made on information or belief are believed to be true, and further that 
these statements were made with the knowledge that willful false statements and the like so made 
are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United 
States Code and that such willful statements may jeopardize the validity of the application or any 
patent issued thereon. 



By: 




Date: 



Victoria Smith, Ph.D. 



Of 1^0 £ 
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EXHIBIT A 



VICTORIA SMITH 

Genentech Inc. 
Dept Molecular Biology 
lDNAWay 
South San Francisco CA 94080 
Ph: (650) 225 7382 
Fax: (650) 225 6497 
Email: victoria@p ene.cnm 



Education 



PhD. (1991) Molecular Biology, University of Qunbridge, Cambridge, United 
Kingdom. 

Honors (1987) Biochemistry, University of Western Australia, Australia. 
Bachelor of Science (1986) Physical and Inorganic Chemistry, and Biochemistry, 
University of Western Australia, Australia. 



Work and Research experience 

Senior Scientist, Genentech Inc (August 1996 - present, promoted to Senior Scientist 
March 2001) 

Lab head, Dept Molecular Biology 

- Identification of potential therapeutic targets for cancer using novel microarray 
technology 

- Discovery and identification of novel secreted proteins 

- Development of cancer therapuetics 

Stanford University, California, U.S.A, (February 1995 - August 1996) 
Research Fellow, Department of Biochemistry . 

Research Genomic functional analysis of chromosome V of Saccharomyces cereoisiae 

Stanford University, California, U.S.A. (January 1992 - January 1995) 
Postdoctoral Fellow, Department of Genetics. 
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Research: Development of new methodology for whole genome functional analysis In 
microorganisms using genomic sequence data and inseitional mutagenesis 
Awards 

Human Frontiers Science Program Organization Long Term Fellowship (accepted, 
4/01/93-1/31/95). " * 

American Cancer Society (California Division) Fellowship (1993, declined). 

Cambridge University, United Kingdom {October 1988 - December 1991) 
Research undertaken at the Medical Research Council Laboratory of Molecular Biology, 
Cambridge, UK, for the degree of Doctor of Philosophy, Cambridge University. 
Thesis: A Molecular Genetic Analysis of Yeast Chromosome DC. 
Thesis Advisor: Dr. Barclay Barrel! 

Awards 

Max Perutz Prize in 1991 for outstanding performance by a graduate student. Awarded 
for advances in genomic-scale DNA sequencing methodology, and genetic analysis of 
the SNP1 gene of Saccharomyces cerevisiae . 

King Edward Memorial Hospital for Women, Western Australia (1988) 
Research Scientist 

Research: Analysis of hormone-inducibie mRNAs in breast tumors. 
University of Western Australia (1984- 1987) 

1984 - 1986: Bachelor of Science degree with double major in Biochemistry and Physical 
and Inorganic Chemistry. 

1987: First Class Honors in Biochemistry. Thesis: Nuclease Sensitivity and Methylation 
Patterns of the Phenylalanine Hydroxylase Gene. 

Awards 

Association of Commonwealth Universities Scholarship for study in the United 
Kingdom (accepted, October 1988 - October 1991, University of Cambridge). 
Hackett Scholarship for overseas study (1988, declined). 

Lady James Prize in Natural Science in 1986: Best student completing Bachelor of 
Science degree with a major in a natural science. 

JWH Lugg Prize in Biochemistry in 1986: Best student completing major in 
Biochemistry. 
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Convocation Prize in Science in 1985: Best student completing major in second year 
rnysics, Geology or Chemistry. 

Cht^ * 01611118(17 inl985:Best *»*«* feting major in second year 



Publications 

• T Jf & . 1CI - LD ' Peranan ' Expression Analysis of the 

Metaplaaa-Dysplasia-Carcinoma Sequence in Barrett's Esophagus (submitted) 

Tice DA. Szeto W. Soloviev I. Rubinfeld B. FongSE. Dugger DL. Winer J. Williams PM 
WW D. Smithy Schwall RH. PennkaD. Polakis P Journal o/Biobgical Chemistry ' 
277<16):1432935,2002Aprl9. gwwmstry. 

NJ. Maughan, F. Lewis, V.Smith (2001) Journal of Pathology 195, 3-6. 

F. Lewis, NJ. Maughan, V^mith, K. Hillan, P. Quirke (2001) Journal of Pathology 195, 66- 

D.LGarfinkel, MJ.Curcio and V^Smigi (1998) Ty Mutagenesis Methods in Microbiology, 
volume 26, 101-117. 69 

C Churcher, S. Bowman, K. Badcock, A. Bankier, D. Brown, T. Oullingworth, R. 

Connor,^DevIm / S.Genties / N.Hamlin,D.Harrte / T.Ho^ 

Jones, G. Lye, S. Moule, C OdeM, D. Pearson, M. Rajandream, P. Rice, N. Rowley, J. 

Skelton, V. Smith, & Walsh, S. Whitehead & B. BarrelL (1997) Nature, 387, 84-87. 

F. S. Dietrich, J. Mulligan, K Hennessy, M. A. Yelton, B. Allen, R. Araujo, E Aviles, A 
Berno, T. Brennan, J. Carpenter, B. Chen, J. M. Cherry, B. Chung, M. Duncan, E 
Guzman, G. Harteell, S. Hunicke^Smith, R W. Hyman, A. Kayser, C Komp, D. 

C^f^J'^ * MOSSdale ' K Nakaha ^ * R- Norgren, P. Oefner, 
Z ~ FXPetel D - Roberls ' P ^' &W 'S.Schrainm,T.Shc^en,y A Smia 1 ,P. Taylors. 
Wei, D. Botstein & R. W. Davis. (1997) Nafcre 387, 78-81. 
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V - Smith ' K D. Lashkari, D. Botstein, and P. O. Brown. (1996). Functional 
Analysis of the Genes of Yeast Chromosome V by Genetic Footprmting. Science 274, 
2069-74 

^mith, D. Botstein, and P. O. Brown (1995) Genetic Footprinting: A genomic strategy 
for determining a gene's function given its sequence. Proc. Natl Acad, Sci. USA 92 
6479-6483. " ' 

V ' Smifl ^ M - Craxton, A. T. Bankier, C. M. Brown, W. D. Rawlinson, M. Chee, and B. G. 
Barrett (1995) Microliter methods for the preparation and fluorescent sequencing of M13 
clones. Recombinant DNA Methodology II: a volume in the Selected Methods in Enzyme-low 
Series, pp. 607 - 621. 



V. Smith, M. Craxton, A. T. Bankier, C. M. Brown, W. D. Rawlinson, M. S. Chee, and B. 
G. Barrett (1993) Microliter methods for the preparation and fluorescent sequencing of 
M13 clones. Methods in Enzymology, 218 , 173-187. 

V ' Smith ^ M - s - ^ee (1991) A simple methodfor sequencing the complementary 
strand of ssDNA from M13 clones. Nucleic Acids Research 19, 6957. 

V Smith and B - G. Barret (1991) Cloning of a Yeast Ul snRNP 70K protein homologue: 
functional conservation of an RNA binding domain between humans and yeast EMBO 
JoumallQ, 2627-2643. 



W. D. Rawlinson, M. S. Chee, V. Smith, and B. G. Barrett (1991) Preparation of large 
numbers of single stranded DNA templates by rescue from phagemids in microliter 
plates. Nucleic Acids Research 19, 4779. 

y- Smifl V C M. Brown, A T. Bankier, and B. G. Barrett (1990) Semi-automated 
preparation of DNA templates for large scale sequencing projects. DNA Sequence 1, 73- 



S. J. Wysocki, E Hahnel, S. P. WiUdnson, V. Smith, and R. Hahnel (1990) Hormone- 
sensitive gene expression in breast tumors. Anticancer Research 10, 185-188 



S. J. Wysocki, B. Hahnel, A. Masters, V. Smith. A. J. McCartney and R. Hahnel (1990) 
Detection of pS2 messenger RNA in gynecological cancers. Cancer Research 50, 1800- 



PATENT8 

Genetic Pootprinting: Insertional Mutagenesis and Genetic Selection. U.S. Patent No. 
5,612,180. Inventors: Patrick Brown and Victoria Smith 

PATENTS PILED ( at Genentech Inc.) 

Methods of Detecting and Quantifying Gene Expression. Inventors: Victoria Smith, 
Edward Robbie, David Lowe, James Marsters. 

Compositions and Methods for the Treatment of Cancer. Inventors: Victoria Smith, 
Austin Gumey, Audrey Goddard, Fred DeSauvage 

Diagnostic for Dysplasia in Barrett's Esophagus. Inventor: Victoria Smith 

Numerous composition of matter filings related to novel gene discovery, pending 



EXHIBIT B 



Unq Id DNA Id 



Lung Tumor Samples 



Experiment Name 



Raw Ratio Normalized Ratio 
(sample/pooled (sample/normal 
epithelial) lung) 
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851. 

851. 

851. 

851. 

851. 

851. 

851. 

851. 

851. 

851. 

851. 

851. 

851. 

851. 

851. 

851. 

851. 

851. 

851. 

851. 

851. 

851. 

851. 

851. 

851. 

851. 

851. 
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1.489 
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0.337 
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0.367 
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1.509 


1 ^7 7*5R ll inn +i imnr 4 OTA . m _ _ • 

10/ ,/ 00. iung xumOr lo/O VS epi pool 
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ioo,y^u, iung tumor ib4/ 
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0.886 
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135,920. lung tumor hf-1 71 9 
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1.076 


157,736. lung tumor hf-1 775 


0.422 


0.742 


157,736. lung tumor hf-1785 


0.429 


0.754 


135,920. Lung Tumor HF1602 


0.346 


0.608 


135,920. Lung Tumor HF1 651 


0.592 


1.041 


135,920. Lung Tumor HF1729 


0.611 % 


1.074 


101,241. Lung Tumor HF631 


0.627 


1.102 


101,241. Lung tumor hf840 


0.579 


1.018 


157,736. lung tumor R1057 vs epi pool 


0.523 


0.920 


157,736. lung tumor R1094 vs epi pool 


1.004 


1.765 


157,736. lung tumor R1094 vs epi pool 


0.813 


1.430 


157,736. lung tumor R1372 vs epi pool 


0.808 


1.421 


157,736. lung tumor R1372 vs epi pool 


0.739 


1.299 


157,736. lung tumor R417 vs epi pool 


0.431 


0.758 



851 . 1 57,736. lung tumor R542 vs epi pool 0.454 

851. 157,736. lung tumor R543 vs epi pool . 0.651 

851 . 1 57,736. lung tumor R544 vs epi pool 1 .407 

85 1 . 1 57,736. lung tumor R544 vs epi pool 0.655 

851 . 1 57,736. lung tumor R685 vs epi pool 0.894 

851. 157,736. lung tumor R693 vs epi pool 0.836 

851 . 1 57,736. lung tumor R693 vs epi pool 0.527 

851 . 1 57,736. lung tumor R737 vs epi pool 0.479 

851 . 1 57,736. lung tumor R742 vs epi pool 0.526 

851 . 1 57,736. lung tumor R777 vs epi pool 1 .041 

851 . 157,736. lung tumor R777 vs epi pool 0.573 

851 . 1 57,736. lung tumor R789 vs epi pool 0.849 

851 . 1 57,736. lung tumor R789 vs epi pool 0.707 

851. 157,736. lung tumor R791 vs epi pool 0.869 

851. 157,736. lung tumor R791 vs epi pool 0.71 1 

851. 135*920. Lung Tumor RNA689 0.209 

851. 135,920. Lung Tumor RNA691 0.536 

.851. 135,920. Lung Tumor RNA693 0.441 

851. 157,736. lung tumor vs epi pool 16:613 

851. 157,736. lung tumor vs epi pool 0.443 

851'. 157,736. lung tumor 15.387 

851. 157,736. lung tumor/Ref.RNA 0.761 

851. 157,736. lung tumor1582/Ref.RNA 0.92 

851. 157,736. lung tumorl 683/epipool 1.35 

851. 157,736. lung tumorl 685/epipool 0.707 

851. 157,736. lung tumorl 853/RERRNA 1 ^491 

851 . 1 01 ,241 . lung tumorhf 641 0.91 1 

851 . 1 57,736. LungBaCa vs Epi 0.342 

851. 157,736. LungBAca1662 vs Epi 3*362 

851. 157,736. LungBAca 2.466 

851. 157,736. LungCaSq-hf1602 0^713 

851. 157,736. LungCaSq-hf1647 0.885 

851 . 1 57,736. LungSqCa-hf 1293 1 .291 

85 1 . 1 57,736. LungSqCa-hf 1 646 0.41 1 

851. 157,736. LungSqCa-hf? 1.112 



Normal Lung Samples 



851 . 1 57,736. normal lung 795 0.221 

851. 157,736. normal lung hf-1 773 0.516 

851. 157,736. N. lung R1415 vs univ. ret 0.991 

851. 157,736. N. lung R1431 vs univ.ref 0.709 

851 . 1 57,736. N. lung R41 7 vs univ.ref 0.669 

851 . 1 57,736. lung Normal 1 052 vs epi pool 0.603 

851. 157,736. lung Normal 1417 vs epi pool 0.317 

851 . 1 57,736. lung Normal R41 7 vs epi pool 0.982 

851 . 1 57,736. lung Normal R41 9 vs epi pool 0.335 

851 . 1 57,736. 423Lunglnf!amTA1 0.517 

851 . 1 57,736. 488-551 LunglnflamTAI 0.486 

851 . 1 57,736. 488-552lnf ImdLungTAI 0.425 

851. 157,736. 1054LunglnflamTA1 0.56 



0.798 
1.145 
2.474 
1.152 
1.572 
1.470 
0.927 
0.842 
0.925 
1.830 
1.008 
1.493 
1.243 
1.528 
1.250 
0.367 
0.942 
0.775 

29.211 
0.779 

27.055 
1.338 
1.618 
2.374 
1.243 
2.622 
1.602 
0.601 
5.911 
4.336 
1.254 
1.556 
2.270 
0.723 
1.955 



0.389 
0.907 
1.742 
1.247 
1.176 
1.060 
0.557 
1.727 
0.589 
0.909 
0.855 
0.747 
0.985 



157,736. 1415Lunglnf!amTA1 0.706 
157,736. 1415LunglnfiamTA1 0.494 
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Expression profiling — best practices 
for data generation and interpretation 
in clinical trials 

The Tumor Analysis Best Practices Working Group* 



Microarrays am routinely used to assess 
mRNA transcript levels on a genome-wide 
scale. As use and acceptance Increases, 
there is intensified focus on appropriate 
methods of data generation and 
interpretation, with Important questions 
being asked about the best data analysis 
methods. The development of such 'best 
practices' is needed, as microarrays — In 
particular, Afrymetrix oEgonudeotlde arrays 
— are becoming increasingly important 
In human cSnical trials, both for 
differentia] diagnosis and monitoring 
of pharmacological efficacy. Here, 
representatives from high-volume 
microarray core centres consider the 
current status of 'best practices', 
focusing on the broadly used Asymetrix 
oligonucleotide arrays. 

Microarrays represent a major technological 
advance in molecular biology. The introduc- 
tion of any such advance is typically followed 
by a period of optimization and standardiza- 
tion. The latter is a crudal part of any maturing 
technology, as it allows an approach in which 
advances are made in parallel by individual 
researchers and companies who contribute 
new knowledge based on the existing stan- 
dard. Any such standards must be constantly 
reassessed; stale or stagnant standards can 
inhibit the development of the technology. 

Microarray-based mRNA-expression pro- 
filing can be considered to be the first mature 
genome-wide analysis technology, reflected in 
an increased interest in using microarrays as 
an endpoint in dinical trials. However, regula- 
tions of clinical trials require the development 
of clear standards for use and interpretation of 
microarray data (commonly referred to as 
quality control and standard operating proce- 
dures (QC/SOPs) anoVor 'best practices'). 
Guidelines for reporting and annotation of 
microarray data from the Microarray Gene 
Expression Data (MGED) Sodcty (see online 
links box) — using MIAME (Minimum 
Information About A Microarray Experiment) 
standards (BOX l) and the MAGE-ML mark-up 



language 14 — represent an important step 
towards this goal. The efforts of this multina- 
tional academic-industry partnership has 
made it possible to develop databases that 
can house the many types of microarray data 
(see below) within the same data struc- 
ture, enabling some data queries between 
experiments and experimental platforms. The 
ArrayExpress microarray database 3 (see online 
links box) is the first major publidy accessible 
database that adheres to this universal data- 
presentation platform, and some prominent 
journals (such as Nnrure, Cell EMBO Journal 
and The Lancet) now demand that published 
microarray data conform to the MIAME 
standards. In addition, microarray manufac- 
turers, such as Aflymetrix, have implemented 
M LAME- compliant data output in their new 
software releases. 

The MGED Sodety has effectively devel- 
oped data-reporting guidelines* but it has 
not addressed issues of data generation and 
interpretation. The latter are more intimatdy 
coupled to the specific experimental plat- 
form. Of the three commonly used types of 
microarrays (spotted cDNA, spotted oligonu- 
deotide and Afrymetrix arrays), each has dis- 
tinct methodologies assodated with them; 
accordingly, the issues of data interpretation 
are also different (BOX 2). These differences 
make it difficult or impossible to develop 
cross-platform guidelines for data generation 
and interpretation. Best practices for spot- 
ted cDNA arrays are especially problematic 
because the manufacture of the arrays varies 
considerably from place to place. In addition, 
all spotted arrays use co-hybridization of a 
test RNA sample labelled with one colour 
KUJORomoRE with a control RNA labelled with a 
different colour to which the test is compared 
on the same spot. The output is in the form of 
a ratio of hybridization signals that is compa- 
rable to other experiments only if the same 
control RNA is always used. Therefore, the 
development of standards in spotted arrays 
would require all laboratories to use the same 
control RNA solution before data could be 
easily compared 



Manufactured oligonucleotide arrays 
(both mechanically spotted and synthesized 
in situ) have the advantages of being centrally 
produced under controlled conditions. 
Afrymetrix photo urHOGRAPHy-produced arrays 
have been available for nearly 10 years, 
whereas mechanically spotted oligonucleotide 
arrays have only very recently begun to 
appear in the marketplace. For example, 
Agilent Technologies (see online links box) 
recently released 17,000 60-mer oligonu- 
cleotides printed five times each on glass 
slides (85,000 features). Spotted oligonu- 
deotide arrays typically have a single spot per 
gene (single probe measurement), whereas 
Aflymetrix arrays provide multiple measure- 
ments — a series of independent or semi- 
independent oligonucleotides query each 
RNA in solution (the probe set) (box 2). 
Afrymetrix probe sets arc constructed from a 
series of perfect-match and paired-mismatch 
oligonucleotides, allowing some assessment 
of non-spedfic binding and performance of 
the probes. Overall, the Afrymetrix probe sets 
provide a variety of measurements that allow 
robust measures of gene expression. The use 
of multiple perfect-match and mismatch 
probes for each gene enables the development 
of different methods of interpreting the 
hybridization patterns across the probe set 
and calculating a single 'expression level' or 
'signal* that reflect the gene's relative expres- 
sion level. A number of probe -set interpreta- 
tion algorithms for Afrymetrix arrays are 
available (see bdow for discussion). 



"[distinct methodologies] 
make it difficult or 
impossible to develop 
cross-platform guidelines 
for data generation and 
interpretation" 



The increasing use of Afrymetrix micro- 
arrays, and the emergence of this technology 
as an endpoint in clinical trials, has led to 
requests to devdop, in both the pharmaceuti- 
cal and academic research communities, best 
practices in data generation and analysis. 
Given the many differences between spotted 
cDNA, spotted oligonudcotidc and Afry- 
metrix arrays, the best practices need to be 
developed separately for each experimental 
platform; this is in contrast to data reporting 
that can be standardized across all platforms 
(Boxes i and 2). The Tumor Analysis Best 
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Box 2 1 Microarray experimental platforms 

There are three different types of microarray in common use: spotted cDNAs, spotted 
oligonucleotides and Asymetrix arrays. 

Sported cDNA arrays 

Spotted cDNA arrays typically use sets of plasmids of specific cDNAs in gridded liquid aliquots. 
The inserts of each clone are typically amplified by PCR, and a few picolitres are physically 
spotted onto glass slides by liquid-handling robots. Robotic spotters can spot 100,000 spots per 
slide* and duplicate sets of clones are often spotted. The advantages of spotted cDNA arrays are 
that the content of each microarray is determined by the researches with complete flexibility in 
number and type of cDNA clones spotted. Also, the cost per array is relatively low, as the done 
sets are a PCR- renewable resource and the glass slides are themselves inexpensive. The amount of 
the RNA that corresponds to each spot is determined relative to a second con Uol RNA solution 
that is hybridized to the same spot, and a ratio is obtained. 

Disadvantages of spotted cDNA arrays include the variable amount of DNA spotted in each 
spot, the 1 0-20% 'drop out* rate of failed PCR reactions or failed spots and mis-identification of 
clones (that is, the spot Is not what you think it is). Also, there is no control over the actual 
sequence of the clone. As many gene-coding sequences contain regions of sequence that are 
shared with other genes, there are questions of specificity of the hybridization to the relatively 
large cDNA inserts. Spotted cDNA arrays were embraced by most academic centres, owing to 
their flexibility and relatively low cost 

Spotted oligonucleotide arrays 

These arrays are also built by liquid handling on glass slides; however, the input solution is a 
synthetic oligonucleotide (often 60-70-mers).The resulting spotted material is typically of 
known concentration, of known sequence and is single stranded (all advantages relative to 
spotted cDNAs). Most of the process can be automated, leading to less sample mix-up and less 
drop-out of samples. 

Disadvantages of spotted oligonucleotides include the relatively high cost of synthesizing large 
numbers of large oligonucleotides and the non-renewable nature of the resource. Spotted 
oligonucleotide arrays are becoming increasingly available. 

Asymetrix GeneChips 

These micro arrays are factory designed and synthesized. Design is done using software to 
choose a series of 1 1 25-mer probes from the 3* end of each transcript or predicted transcript 
in the genome; each of the 1 1 probes is then paired with a similar mismatch probe that is designed 
to contain a mutation in the centre The latter serves as a form of control for hybridization 
specificity. Synthesis of arrays is done using light-activated chemistry and photolithography 
methods, and feature size can be reduced to approximately 8 um J , with about 1 million probes in a 
1 .2 cm 2 glass area. Probe-set algorithms interpret the signals from each 22-oligonucIcotide probe 
set, and derive a single value (signal) from the patterns of hybridization to the 22 individual probes. 
This signal is then normalized to the entire microarray, or to the probe sets across an entire project 

For a more general discussion of normalization and analysis methods of different microarray 
platforms, the reader is referred to the excellent web information resource of the MGED group 
(see The MGED Data Transformation and Normalization Working Group in online links box). 



Practices Working Group (see box 3) was 
convened to discuss and develop best prac- 
tices for Aflyrnetrix microarrays, including 
QC and SOPs for both data generation and 
data analyses. The first meeting was held in 
Santa Clara in March 2003, followed by a 
series of conference calls that focused on dis- 
cussions of data generation and analysis stan- 
dards for the Affymetrix oligonucleotide 
arrays. The Working Group deliberately 
focused on a platform that has widespread 
usage and is most likely to be used in clinical 
trials owing to the previously standardized 
manufacturing process. Here, we discuss rec- 
ommendations for experimental design, 
probe-set analysis algorithms, signal/noise 
assessments and biostatistical methods. 

Experimental design 

Appropriate experimental design is a key 
aspect of all science, and microarray studies 
arc no exception. The relatively high cost of 
some commercial microarray platforms is a 
frequently cited reason for suboptimal experi- 
mental design, especially with regards to the 
number of replicates. Data interpretation is 
inevitably compromised when replicates are 
decreased. 

Replication in cross-sectional studies. The 
appropriate number of microarray replicates 
for any particular condition or time point 
depends on the source of biological variability 
in the study samples. Inter-individual variabil- 
ity is very large in outbrcd (genetically hetero- 
geneous) humans, but is very small within 
inbred mouse strains. For example, expres- 
sion profiles derived from muscle from differ- 
ent mice are not more variable than from 
muscles isolated from one mouse 4 . Defining 
the confounding variables that contribute to 
experimental variability, such as intra-subject, 
inter-subject, inter-group and technical vari- 
ation (microarray protocol), is needed to 
design and statistically power a study, and 
to detennine the number of replicates that are 
needed. In general, inbred mice require test- 
ing only three or four mice per group. We and 
others have found that five or six out-bred 
rats per group provide statistically robust 
results'**. By contrast, human samples require 
considerably more individuals per group. Key 
variables in human samples include tissue 
heterogeneity, stage of disease and inter- 
individual variation, ail of which have been 
found to be major confounding variables 7 . 

Replication in longitudinal studies. It has long 
been recognized that, in human clinical trials, 
longitudinal designs provide considerably 
greater power at lower numbers of replicates. 
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Box 3 1 The Tumor Analysis Best Practices Working Group* 

The Tumor Analysis Bcsl Practices Working Group is a group of investigators who study the best 
practices of tumour analysis in humans taking part in clinical trials. The following authors are 
members of the Group: 

• Eric P. Hoffman is at the Research Center for Genetic Medicine, Children's National Medical 
Center* Washington DC 20010, USA. email: ehoffinon@cnmcresearch.org 

• Tarif Awad, John Palma, Teresa Webster, Earl Hubbeli and Janet A Warrington are at Affymetrix, 
Santa Gaxa, California 9505 1, USA. emails: tarif_awad@afrymeuix.com; 
johnjamia@affymetrix.com; teresa_webster@aflymetrix.com; earl_hubbeU@airymetrix.com; 
janet_wanmgton@afrymetiix.com 

• Avrum Spirais at The Pulmonary Center, Boston University Medical Center and the 
Bioinfbrmatics Program, Boston University, Boston, Massachusetts 021 18, USA. e-mail: 
aspira@lung.bumcbu.edu 

« George Wright is at the Biometric Research Branch, Division of Cancer Treatment and 
Diagnosis, National Cancer Institute, National Institute of Health, Bethesda, Maryland 20892, 
USA. e-mail: wrightge@mailjiih.gov 

• Jonathan Buckley and Tim Triche are at the Children's Hospital, University of California, Los 
Angeles, California 90O89, USA. e-mail buckley@hscuscedu; tricht@hsc.usc.edu 

• Ron Davis, Robert Tibshirani and Wen_hong Xiao are at Stanford University, Palo Alto, 
California 94303, USA. e-mails: dbowe@stanford.edu; tibs@statstanford.edu; 
wzxiao@pmgm2_tanford.edu 

■ Wendell Jones is at Expression Analysis Inc., Durham, North Carolina 27713, USA. e-mail; 
wjoncs@expressionanalysis.com 

• Ron Tompkins is at Harvard University, Boston, Massachusetts 021 15, USA. e-mail: 
rtompkins@par_iers.org 

• Mike West is at the Institute of Statistics and Decision Sciences, Duke University, Durham, 
North Carolina 27708, USA. e-mail: mw@statduke.edu 



They best control for inter-individual varia- 
tion because each subject serves as their own 
control. Serial blood sampling from single 
subjects is the least invasive 1 {see below for 
further discussion), and, for example, cancer 
patients are often longitudinally sampled 9 . 
Serial biopsies of other tissues are more inva- 
sive: however, a number of serial human mus- 
cle biopsy studies of healthy volunteers after 
different types of exercise training have begun 
to appear in the literature^ 11 . 

Expression profiling of blood samples 
(longitudinal or cross-sectional design) is the 
protocol that is most likely to be used in 
human clinical trials. One of the Working 
Group's goals was to establish SOPs for blood 
sample collection and RNA isolation in clini- 
cal trials. A specific follow-up report of these 
recommendations will be published else- 
where. Such a protocol must be easily adapt- 
able to multiple trial sites, with relatively little 
need for resident expertise to carry out the 
isolation protocol. So far, standard methods for 
isolating peripheral-blood mononuclear cells 
have shown the most reproducibility, although 
others are being tested (see Aflymetrix Tech- 
nical Note in online links box). Cells isolated 
soon after collection can be flash frozen for 
storage and subsequent RNA isolation or an 
RNA stabilizing compound can be added if 
the samples need to be transported. 

Tissue/cell heterogeneity. Tissue heterogeneity 
is a major coiifounding variable in most micro- 
array experiments, In inbred mice, tissue het- 
erogeneity is typically normalized by using 
whole organs. This is rarely possible in human 
experiments, and particularly not in clinical 
trials; the limited amount of human tissue 
that is available exacerbates heterogeneity. The 
mixed cell populations of peripheral blood 
can be thought of as a tissue heterogeneity 
problem similar to that encountered in all 
solid tissue and tumour biopsies. Indeed, a 
recent study showed that variation as a result 
of tissue variability in human muscle biopsies 
often exceeded inter-individual variability 12 . 
One potential solution to the tissue hetero- 
geneity problem lies in bioinformaric meth- 
ods. If computer software can be trained to 
recognize the expression profiles of each indi- 
vidual cell type within a mixed tissue sample, 
then it should be possible to subtract them 
from each other and to renormalize to obtain 
a set of cell-specific expression profiles derived 
from a single mixed profile. This will be most 
easily done on tumour biopsies, in which the 
main cells of interest are tumour versus conta- 
minating normal tissue. Although there are no 
published examples so far, such methods are 
maturing rapidly. 



An experimental alternative to mitigate 
confounding tissue heterogeneity is to isolate 
pure cell populations for expression profiling. 
Many such methods are well developed in the 
research laboratory, including i_uoju_c_kce- 

ACTtVATED CELL SORTING (FACS)'\ NEGATIVE CELL 

isolations from blood (for example; Stem Cell 
Technologies RosetteSep) 14 and laser capture 
microdissection 15 . To research scientists, the 
profiles that are derived from isolated cell types 
are a more intuitive approach to define biologi- 
cally relevant pathways. However, it should be 
noted that uses of array-based analysis of gene 
expression approved by the US Food and Drug 
Adrnintstratjon (FDA) will probably focus on 
reproducibility and robustness (as well as on 
predictive accuracy), rather than on biological 



"If computer software can 
be trained to recognize the 
expression profiles of each 
individual cell type within a 
mixed tissue sample, then it 
should be possible to subtract 
them from each other. . 



interpretation or justification. The high-tech 
methods used to isolate specific ceO types from 
dinical samples are unlikely to make their way 
into clinical trials unless tissues are procured in 
a highly centralized way. 

Procedural variation. Beyond the usual issues 
of sampling and accrual, gene- expression data 
will be subject to many additional sources of 
error. For example, the surgical removal and 
processing of tumour tissue can vary consid- 
erably from site to site. Laboratory QC proce- 
dures in tissue handling, RNA extraction and 
processing, and variations in protocols for 
data management and processing will need to 
be addressed in any clinical trial design. In 
particular, prolonged tissue ischaemla prior to 
prcKDessing of surgically RESEcrcn tissue can sig- 
nificantly alter gene expression 1 *. All tissue 
samples should be flash frozen within min- 
utes of surgery and stored at -80°C or below. 
Samples should also be kept in small, airtight 
containers and kept from drying out during 
frozen storage by placing fragments of ice in 
with the sample. 

Technical variability 

The standard laboratory protocol for generat- 
ing RNA profiles using Afrymetrix microarrays 
irwolves a series of steps (RG. I ). 
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Figure 1 1 Sample processing and micro array 
Interpretation of Affymetrix GeneChips. Ftash- 
trozen tissue (-50 mg) Is homogenized to isolate 
total RNA. Singe-stranded {ss) cONA and then 
double-stranded (ds) cONAis made from -5 ug 
of total RNA, Double-stranded cONA contains a 
77 RNA-polymerase promoter adjacent to the 
3* pofyAta3 of each transcript It is transcribed 
^^frDtoger^temcnithan400biotir^ed 
cRNA rnctecutes for each ds cONA nrtecute. "The 
biolinylated cRNA is fragmented and hybridized 
to the microarrays. Each transcript Is queried by 
one or more probe sets of 1 1 perfect-rriatchand 
1 1 paired -mismatch drgontideotides (the tatter 
contain a centrally located point mutation as a 
lorm of hybridization specfficity control). Currently 
evaHable Affymetrix microarrays have -54,000 
probe sets on each 1 .28 cm 2 glass rrdcroarray 
(-1 .2 mlon 25-mer otigonucleotides on the 
HG-U1 33Ptus 2.0 array). The biotinyiated 
rjRNAfragrnents hybridize to the appropriate 
ofcgonudeo&ia features. A laser scanner determines 
tna smount of bound biotinyialed cflNAindf/eclly 
through the strep tavicfin • conjugated phycoerythrin 
fuoroscGnco at oach feature wfrrin a pjot>e set. Tho 
component probe pairs are interpreted and 
averaged to arrive at a single signal that reflects the 
relative arjundance of the original mRNA Probe 
sets are interpreted by any one of a number of 
probe-set algorithms, each providing a signal that 
reflects the relative rrybrtrJcation Intensity across the- 
probe set RT. reverse transcriptase. 



RNA isolation. RNA quality and quantity is 
crucial to the success and reproducibility of 
the expression profiles. RNA quantity and 
quality is generally checked by complemen- 
tary methods: UV 260/280 ratio > 1 .8, agarose 
gel electrophoresis or an Agilent Bioanalyzer 
to visualize dear 18S and 28S ribosomal RNA 
bands. Total RNA (5-10 ug) is input into the 
cDNA/cRNA reaction, with an expected cor- 
rected yield of biotinyiated cRNA of between 
4- and 10-fold greater than the total RNA 
input (so 5 ug of total RNA must yield at least 
20 pg of biotinyiated cRNA, or the sample is 
discarded). Hie biotinyiated cRNA should be 
500-3,000 base pairs (bp) in size. After frag- 
mentation, the cRNA should be 50-200 bp. 
The Working Group recommends that sam- 
ples that do not meet these criteria should be 
discarded. 

If RNA amount is limiting — as is the case, 
for example, with laser capture microscopy 
samples, Row-sorted cell samples or small tis- 
sue samples — a two-round amplification 
protocol can be used. For example, 200 ng of 
total RNA is processed for in vitro transcrip- 
tion (IVT), with the same goal of 4-10-fold 
amplification (>800 ng of cRNA output). 
One hundred nanograms of this cRNA is 
then reverse transcribed into cDNA using 
random primers, after which a second IVT is 
done. The second round IVT should result in 
a 400-fold amplification. 

Micro array controls. Hybridization controls 
include visualization of the image so that any 
abnormalities in hybridization patterns can be 
detected. ProbeProffler from Corimbia Inc. is 
a program with extended capabilities for 
detecting defects in microarray manufacture. 
Asymetrix MAS 5.0 software adjusts the 
mi croarray-scann ed image to a common tar- 
get intensity by using a scaling factor. In addi- 
tion, a general index of chip background and 
noise is represented by the percentage of 'pre- 
sent calls* (probe sets for which the hybridiza- 
tion to the perfect-match probes is significantly 
higher than mismatch hybridization). The 
Working Group believes thatboth the scaling 
factor and the percentage of present calls arc 
important QC criteria. Considering MAS 5.0 
chip analyses, the scaling factors to normalize 
chips within a project should He within two 
standard deviations of the mean, with present 
calls being greater than 25% (box 4). The per- 
centage of present calls is often lower when 
B or C arrays that contain higher propor- 
tions of more poorly characterized transcript 
units (expressed sequence tags or computer- 
predicted open reading frames) are used. The 
percentage of present calls across a set of sam- 
ples should be consistent, within a range of 



1 0%. Some software packages allow the iden- 
tification of statistical 'outlier' microarrays in 
a group of microarrays in a given project, 
which additionally enables the experimenter 
to flag and exclude specific microarrays that 
are not acceptable for an analysis. In addition 
to these criteria, acceptable hybridizations 
must have adequately intact input RNA as 
shown by 3' to 5' ratios of hybridization within 
probe sets. A typical control is the gryceralde- 
hyde 3- phosphate dehydrogenase (GAPDH) 
gene, which should have 3' to 5' ratios of less 
than 3 (box 4). 

The QC criteria provided above are based 
on MAS 5.0 probe-set algorithms and data 
analyses. The measures of present calls and 
scaling factors are useful and serve as initial 
summary measures of the performance of a 
particular rmcroarray. However, more focused 
statistical methods, coupled with routine 
visual inspection of images, hold promise for 
the continuing improvement of data quality 
and screening abilities. 

Large-scale analyses of microarray data 
across laboratories have not yet been reported. 
However, the Working Group feels that adher- 
ence to the above QC criteria, using standard 
RNA isolation and processing methods, should 
yield data that are consistent between laborato- 
ries and intrinsically comparable. The same 
set of criteria can also be used as best practices 
for data generation in the design and conduct 
of clinical trials. 

Standard clinical laboratory practice is 
to develop programmes for submission of 
known samples to different laboratories 
and assessment of comparability of results. 
Such programmes are under development 
within larger collaborative efforts, such as 
the National Heart, Lung and Blood Institute 
(NHLBI) Programs in Genomic Applic- 
ations (see the HOPGENE Program for 
Genomic Applications in online links box) 
and the National Institute of General Medical 
Sciences (NIGMS) Glue Grant (see online 
links box). 

Data analysis and Interpretation 

Signal generation versus statistical analyses. 
Two relatively distinct steps underlie all 
data analyses of Afrymctrix oligonucleotide 
microarrays: the development of a normal- 
ized 'signal' for each transcript on each 
microarray and the subsequent statistical 
analysis of differences in signals between dif- 
ferent arrays. The first step involves probe- 
set algorithms that use all, or part, of the 
component signals within a probe set and 
then derive a single signal that is representa- 
tive of the relative abundance of each mRNA 
queried in each array. The second step is the 
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Box 4 ] Quality Control metrics 'tor: Affymetrix microarrays 

RNA quality -V. 

cDNA/cRNA efladcnqf 

M-fold amplification from total KNA ] 500-3,000 bp prior to fragmentation 1 50-200 bp after 
fragmentation 

Chip hybridization 

Image inspection for defects I Scaling factors within two standard deviations within a project I 
MAS 5.0 present calls >259fe;fbr the a-seres arrays, including the HG-U133Plus 16 array I 
Per^tegepn^ 

: ^atioi^':;V'^^; v •/^^:/' :: ^'' ; " : "- !' : - :\---.'. 

Project normalizatibn 

The detection of statistical outliers for chips, probe sets or individual probe pairs requires. . 
normalization and analysis across an entire project This is afforded by the dCHJP and 
ProbcProfiler, and other software packages. Data-analysis packages that rely on intra-chlp . 
normalization and scalmg typically do not enable detec statisticfll outliers. 

Chip outliers 

Probe-set outliers I Probe-pair outliers I Range in present calls < 10% 



application of bioin/ormatic and statistical 
methods to identify interesting subsets of 
the assembled data of all arrays within a 
project. There is considerable debate about 
the best methods for both of these steps (see 
below for a discussion). Although the two 
steps are separable, it is clear that they have a 
marked influence on each other. It is in this 
realm that the bioinformatics of microarrays 
becomes avant-garde, and with the ground* 
breaking nature of research comes consider- 
able debate as to what is appropriate in any 
specific situation. 

Before discussing the different methods for 
probe-set analysis and data interpretation, it is 
important to point out that much of the 
debate in the field of bioinformatics about 
microarray interpretation revolves around sig- 
nal/noise ratios. A common assumption is that 
signal/noise ratios across a microarray are 
homogeneous, or at least similar in magnitude, 
This might be true for general background 
hybridization, but not for the performance of 
probe sets, in any particular microarray, there 
are probe sets that give very strong and dear 
hybridization patterns and those that perform 
poorly Many of the best performing probe sets 
(those with a highly significant probe-set 
detection p value) reflect highly expressed tran- 
scripts with no closely related sequences that 
might aoss-hybridize. Low-level transcripts, or 
transcripts that belong to gene families with 
highly homologous sequences derived from 
distinct genes, often have corresponding probe 
sets that do not perform as well and might 
have a significant, if not overvvhelming, noise 
component The signal from such probe sets 
is difficult to interpret, and data interpreta- 
tion can be limited to only the best perform- 
ing probe sets, although arguably the most 
interesting data comes from the genes that 
are expressed at low levels but that still show 
significant differences between samples. 

Determining adequate sensitivity of the 
signals and signal/noise responses relative to 
the absolute quantity of mRNA in clinical 
samples is crucial as microarrays become a 
component of clinical trials and diagnostic 
models. Affymetrix arrays provide a concen- 
tration of each mRNA queried relative to the 
genome-wide mRNA profile of the sample; it 
is assumed that the global mRNA content of a 
tissue as a whole docs not change significantly, 
making relative mRNA quantification an 
accurate reflection of the response of the indi- 
vidual gene. This method differs from absolute 
quantification of specific mRNAs (such as 
si NuajusE protection and REALTIME pcr), or the 
isolated transcript ratio determined by co- 
hybridization of two samples to spotted 
cDNA or oligonucleotide arrays (BOX 2). 



Asymetrix arrays achieve considerable sensi- 
tivity through the inherent redundancy of 
the probe set; however, the Working Group 
acknowledged that some genes, such as some 
cytokines that are functional at very low 
expression levels, are probably below thelimit 
of detection. 

The Working Group agreed that each 
project will have its own signal/noise opti- 
mum, and analysis methods that prove best 
for one project might prove unsuitable for 
another. Ideally, a signal/noise ratio should 
be optimized for each project or trial using 
different probe-set algorithms and data- 
filtering methods, and some systematic 
efforts towards this end are beginning to 
appear in the literature 17 . 



"... adherence to the above 
QC criteria ... should yield 
data that are consistent 
between laboratories and 
intrinsically comparable " 



After a signal is derived for each probe set, 
data is interpreted using statistical and visual- 
ization methods. All statistical methods run 
into two generic problems when faced with 
microarray data that are inter-related. The 
first is the curse of dimensionality — each 
gene is potentially related to every other gene, 
so all permutations of all available data must 
be considered, leading to an exponentially 
increasing number of possible associations in 
multidimensional space. The problem arises 



when associations (samples) become lost as 
the dimensionality increases — associations 
lose their local value and become gcnericalJy 
global in statistical terms. Statistical models 
attempt to circumvent this curse by requesting 
larger and larger sample sizes, but fulfilling 
the requests becomes functionally impossi- 
ble for the experimentalist. There is no easy 
answer to these problems and they remain a 
challenge for future bioinformatics research 
that uses microarrays' 1 . 

Derivation of signal: probe-set algorithms and 
normalizations. One of the key advantages of 
the Affymetrix platform is the multiple mea- 
surements that are intrinsic to the probe set 
— most probes indude 1 1 perfect-match and 
II paired-mismatch 25-bp oligonucleo- 
tides per gene (FIG. I). Previous versions of 
GeneChip arrays used probe-set design meth- 
ods that led to considerable overlap between 
probes, so that hybridizations to each fea- 
ture/probe were not independent measure- 
ments; this led to considerable uncontrolled 
weighting of the contribution of any particu- 
lar region of sequence to the resulting signal 
All recent chips use a much more refined 
probe-set design with less overlap and consid- 
erably better performance of the probe set. 
Improvements in array and probe-set designs 
have been accompanied by an evolution in 
primary analysis algorithms and the support- 
ing software provided by Afrymetrixfor data 
analysis and interpretation 1 '. Afrymctrix 
default algorithms are based on well-docu- 
mented statistical methods, namely the robust 

TUKETS BI-WEIGHT ESTIMATOR and WlLCOXO*TS SIGNED 

rank, to calculate the final probe-set signals 
and associated p values, resrjectively 4 * 20 . 
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Table 1 1 Comparisons of probe-set analysis algorithms 



Asymetrix MAS 5.0 

dCHIP cfrfference 
-model 

dCHIP 

,rma:. :^ 

ProbeProfiler 



mismatch signal 

High 

High 

None 
Norte 
Moderate 



Normalization method 



Individual chips 
C^oss-project: 

Cross-project 
r^oss^projeci 
Extensive 



method-- Qj^jer detection Sensitivity* Specificity* 

Little 

^Mc^eVate i 



Moderate 
Moderate 
Extensive 



Good 
Good 

Excellent 
Exc^lerit 
Good 



Excellent 
Excellent 

Good 
Good 
Good 



is based P™ 3 ®* °n ROC (rocdvor operating characteristic} curves of spfce-in mRNA dala based on pubfished reports (see htWAvww.btocwx^or.ora) 21 ^ 
'Specificity measurements are based both on expectations from mismatch weights and pubfchod observations in experimental data sets' 7 -' 8 . 



Afrymetrix has announced plans to con- 
tinue to improve the software components of 
the GeneChip platform. The upcoming release 
of the GeneChip Operating System (GCOS) 
is expected to incorporate refinements in the 
user interface, data management and analysis 
algorithms. Software tools aside, the most sig- 
nificant development on the analysis front is 
arguably the decision by Affymetrix to release 
previously proprietary chip-design details, 
such as probe sequences, chip -design parame- 
ters and file APIs (applications programming 
interfaces). The goal is to encourage scientists 
to develop innovative analysis tools that can 
potentially derive more biological value from 
GeneChip expression data. The challenge of 
providing a constantly growing and evolving 
body of information associated with arrays 
has been solved in part with a web-based 
tool The company's NEXAFEX web site (see 
online links box) serves as the public portal 
for detailed information on chip design and 
has become a valuable resource for biological 
follow-up of GeneChip expression results. 
Third-party software developers can find 
additional support, including information on 
file APIs, through the Aflymetrbc Developers' 
Network (see online links box). 

Encouraged in part by the openness of the 
platform and spawned by an increase in 
knowledge and experience in array data 
analysis, scientists are developing a number of 
alternative algorithms for probe-set analysis, 
with the goal to derive the best signal that is 
representative of the mRNA level for each 
gene. As each signal is relative to other signals 
in the experiment (both between arrays for 
the same gene and relative to all other genes 
on the array), the process of normalization is 
intimately tied to derivation of signal. The 
more commonly used alternative probe-set 
analysis algorithms include dCHIP 20 , RMA 21 
and ProbeProfiler (Table i ). 

It is outside the scope of this article to dis- 
cuss the nature of the different probe-set 
interpretation and normalization algorithms 
in depth, and the reader is referred elsewhere 72 . 



The algorithms differ in a number of impor- 
tant ways (table i). First, the penalty weight 
that is assigned to the mismatch probe varies 
— MAS 5.0 assigns a relatively heavy penalty 
for cross-hybridization to the mismatch 
probe, RMA assigns no weight and dCHIP 
gives the choice of providing weight or no 
weigh L Second, the ability to discard outlier 
signals varies from package to package, with 
dCHIP and ProbeProfiler having refined 
methods to detect outliers at each level of 
analysis (probe, probe set and microarray). 
These packages are able to replace deviant 
probes with expected data based on the 
remainder of the probe set, and/or flag abnor- 
mal probe sets and arrays for possible exclu- 
sion from further data analysis. Third, the 
method of normalization varies from within a 



f \ . .robust feature selection 
for the purpose of diagnosis 
and molecular markers in 
clinical trials requires robust 
statistical methods. . 



single array (MAS 5.0) to a project-based nor- 
malization (dCHIP, RMA and ProbeProfiler). 
Finally, MAS 5.0 provides a detection p value, 
in which a number is assigned to the confi- 
dence of the signal in question. This can be 
used to weight different probe-set signals in 
subsequent data interpretations. 

The output of all packages is a normalized 
signal (with or without an associated detec- 
tion p value) for each probe set on each array. 
These signals arc then fed into data interpre- 
tation packages for statistical analyses and 
data visualizations. 

Different probe-set interpretation algo- 
rithms lead to different results. Members of 
the Working Group often encounter -50% 



concordance in general data output in their 
own work between comparisons of two dif- 
ferent algorithms. However, it is crucial to 
note that the Urge majority of discordant data 
lies in regions of relatively poor signal/noise 
ratios, and concordance deteriorates in exper- 
iments with high levels of confounding noise. 
In general, the programmes that put less 
weight on the mismatch show better sensitiv- 
ity (linearity) when signals are noisy (table i). 
However, this increased sensitivity can come 
at a cost of substantial contaminating noise 17 . 

The Working Group recommends using at 
least two probe-set algorithms for compari- 
son and prioritization of gene selection (for 
example, MAS 5.0 and the dCHIP difference 
model). 

Data interpretation. Most published microar- 
ray papers could be considered data-poor in 
terms of replicates and systematic statistical 
analyses, but data-rich both in terms of 
amount of high-quality data generated and 
significant research findings. Below, we point 
out the most appropriate current bioinfor- 
matics methods and additional methods that 
require further development so that data can 
be more fully mined for information content 

A second general backdrop to the follow- 
ing discussion is that data visualization is 
one of the most powerful data interpretation 
tools, yet it rarely obeys statistical principles. 
The resolution of the human eye, coupled 
with the abstract computational power of the 
human brain, lies behind the popularity of 
hierarchical clustering and other non-statistical 
principles and visualization methods. How- 
ever, the eye and brain are poorly suited to 
spontaneously deriving statistical support 

There are two general types of experimen- 
tal design that lend themselves to different 
types of statistical and visual analysis: the 
cross-sectional study and the time-series study. 
The cross-sectional study typically has gene or 
pattern selection as the goal the identification 
of one or more genes or patterns of expression 
that arc diagnostic of the condition or state 
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under study. This 'gene selection* might be for 
truly diagnostic purposes (for example, differ- 
ential diagnosis of leukaemia), or might be 
intended to identify relevant biochemical 
pathways. In both cases, the gene or pattern 
selection must be robust, usually implying a 
statistically principled approach, with subse- 
quent validation by predictive computer mod- 
elling (internal cross-validation) or, preferably, 
prospective validation on new data- 
Feature selection can be the main limiting 
factor in evaluation of the predictive perfor- 
mance of an analysis method when there are 
many predictors to select from. This was a 
'mantra* for some of the senior statisticians 
involved in predictive modelling with gene- 
expression array data for several years, but 
only now do the non-statistical users and 
developers of predictive models from non- 
statistical perspectives begin to appreciate 
these issues. Proper validation of any model or 
algorithm that relies on explicit feature selec- 
tion — such as choosing a subset of 70 genes 
from 20,000 — that underlies the resulting 



prediction simply must ensure that the analy- 
sis is tested by internal cross-validation that 
includes feature re-selection as part of the vali- 
dation 23,2 *. The Working Group acknowl- 
edged that prospective validation of any 
findings using new data is the acid test of 
predictive performance. The focus on fea- 
ture or gene selection is vitally important 
when microarrays are used for differential 
diagnosis and has been best studied in cancer 
biopsy/tissue studies. 

An increasing proportion of micro array 
studies focus on delineation of biochemical 
pathways that are modulated in response 
to some stimulus. In practice, these studies 
typically use feature selection to identify 
potential pathways that arc involved in the 
response of the cells or tissues, validation 
is then done on the identified biochemical 
pathways of interest, using mRNA (real-time 
PCR) or protein studies, often proving cause 
and effect in experimental models. 

The Working Group notes that robust 
feature selection for the purpose of diagnosis 



and molecular markers in clinical trials 
requires robust statistical methods, as out- 
lined below, and the burden of proof lies 
with statistical validation. For microarray 
experiments designed to delineate biochem- 
ical pathways, feature selection is used for 
generating a hypothesis and the burden of 
proof of the hypothesis lies with laboratory- 
based research, often at the protein level. 

For feature selection, the Working Group 
recommends that users experiment with 
various statistical methods (such as standard 
parametric tests, nonparametric methods, 
false discovery rate and related methods 25 , 
global or local shrinkage of raw signal inten- 
sities and Stanford's 'nearest shrunken cen- 
troids'"). Developments related to survival 
data analysis are receiving increased attention 
because clinical trials will raise the need to 
move that way. As a corollary, analysis meth- 
ods that focus on signatures of groups of 
genes (such as averages of clusters, Duke's 
metagenes 27 -* 9 and Stanford's eigengenes 30 ) 
seem worth stressing in predictive contexts. 



glossary 

A-, 8- AND C- SERIES ARRAYS 

A series of human, rat and mouse Afiymetrix arrays 
released in 2003, in which the A array contained the 
best-characterized genes, and B and C arrays contained 
less well-defined expressed sequence tags. In 2004. all 
probe sets have been condensed so that there is only 
one microarray per species that covers the entire 
genome. 

CKOSS-SECnONAL DESIGN 
The use of different subjects in an experimental and 
control group or groups. The statistical analysis 
compares the median and variation within each 
group relative to the other groups. 

FEATURE 

Typically one dement (spot) on a microarray; 
In spotted cDNA or oligonucleotide arrays, 
features correspond in genes or Iran scripts; in 
Affymetrix arrays, there arc typically 
22 elements per probe set and often multiple 
probe sets per gene, so a feature might refer 
to a single oligonucleotide, a probe pair or 
a probe set, or a gene with multiple probe 
sets. In bioinformatics it is most often 
synonymous with a gene. 

aUORESCENCE-ACnVATEO CELL SORTING 
(FACS). A method whereby dissociated and individual 
living cells are sorted, in a liquid stream, according to the 
intensity of fluorescence that they emit as they pass 
through a laser beam. 

FUJORCPHORE 

A small molecule, or a part of a larger molecule, 
that can be excited by light to emit fluorescence. 

tSCHAEMlA 

The loss ofblood supply, and hence oxygenation, to a 
tissue or cells. 



LASER CAPTURE MICRODISSECTION 
A technique in which individual cells, or regions of tissue, 
are excised from ahistoIogjcaJ preparation, using specially 
equipped microscopes, and isolated for further study. 

LONGITUDINAL DESIGN 

The use of multiple samples from the same subject With 
this design. each subject serves is their own control, 
eliminating confounding inter-individual variations at 
baseline; paired Meats are used to interpret the data. 

NEGATIVE CELL ISOLATION 
The use of antibodies or other reagents to remove all 
unwanted cells from a mixed population of cells. In this 
method, the desired ceUs are not exposed to bound 
antibodies, thereby avoiding potential activation or other 
molecular alteration in the desired cells. 

PENALTY WEIGHT 

In Affymetrix arrays, hybridization to the 'mismatch* 
probe of a probe pair might or might not be considered 
as a form of measurement of noise or background, and 
can be factored into the sign al seen with the paired 
•perfect match* as a penalty weight 

PHOTOUTHOG RAPHY 

The process of using light to cither etch or activate 
regions of a surface (substrate). This method is used in 
microelectronics to create integrated circuiu and 
processors, 

REAL-TIME PCR 

The quantification of the amount of PCR product 
during each cyde of a PCR reaction. The product 
concentration, as a function of cycle number, provides 
a good estimation of the relative quantity of the 
mRNA being tested. 

RESECTION 

Surgical removal of tissue, most commonly used for 
removing tumorous masses from surrounding tissue. 



St NUCLEASE PROTECTION 
An experimental method for determining mRNA 
transcript concentration in a tissue or cell RNA sample. It 
involves using labelled DNA probes that bind the RNA, 
with overhanging non- hybridized laib of the probe then 
being digested by the 51 nuclease. This creates a smaller 
labelled DNA probe that is indicative of the abundance of 
the mRNA being tested. 

SURVIVAL DATA ANALYSIS 

A battery of suusucal methods applied to data when 
mortality is often the only, or best, measured outcome. 

TIME-SERIES STUDY 

The use of a series of samples taken at defined time 
points after a defined stimulus. In mice and rats, the 
samples at different time points are usually from 
different animals. In humans, time-aeries studies are 
necessarily longitudinal to avoid additional confounding 
noise. 

TUKETS BI- WEIGHT ESTIMATOR 
Many statistical tests require underlying definitions 
that are assumed to be valid (for example, tumour 
versus non-tumour), and require data that show a 
normal distribution. Microarray data, and the 
clinical information underlying the definition of 
samples, is often less exact, with genes or samples 
often performing as statistical outliers. Tukey's bi- 
weight estimator is one of the M- class of statistical 
models that is less sensitive to outliers and performs 
more gracefully when underlying assumptions are 
inexact. 

WI1COXON-S SIGNED RANK 
A statistical test drat investigates the population 
median of paired differences. It is wdl suited for 
microarray work as it treats each gene as an 
independent variable and does not require normal 
distributions of the data. 
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Rgure 2 1 Dense time-series data with adequate repOcates can provide robust visual 
Interpretation of data. Dynamic single-gene queries can provide visually compelling results 
and avoid many issues that compScale statistical analyses of cross-sectional mfcroarray studies. 
Dynamic queries of the go/7 transcript probe set from a tlrne-sertes study Is shown. "The*- axis represents 
time fin hours) after rats were given a bolus of 3-matrvyJprecWsdorie. They-axte represents relative 
expression level. InrfvWual rats were stucfed at each time point (green data points), with both liver (panel a) 
and muscle (panel b) tissue taken from the same animals. Averages of the repOcates for each data point are 
shown fn magenta); the graph Bne is drawn between the averages, fii basefine (time 0). the gotl transcript 
has a normafized signal of about 2,400 in Dver and 1 ,200 in musde. The gene is dearly responsive to 
3-methyiprecfnisdone in liver (panel a), where expression rapidly increases within the first hour, plateaus 
between 1-7 nr. then qiricWy faDs back to baseline by 12 nr. The recreates appear relatively consistent 
By contrast, the same gene in musde does not seem to respond to the drug; the variability in replicates 
is larger than any temporally-relevant change. Data from http y/nrMcroarray.cnrri cresearch .org/ 
sindegenemaiaasp and RF.fs 632. 



Whatever the specific statistical model that is 
applied for prediction, using aggregate gene 
expression has important consequences: 
measures of aggregation of expression over a 
group of genes with related profiles can 
reduce dimension (thereby mitigating the 



curse of dimensionality), This can reduce 
multiplicities and, to some degree, ease the 
problems of gene selection, multiple testing 
and co -linearity, while improving signal 
estimation by averaging correlated noise 
components. 



Data visualizations, time series and candidate 
geties.Thc above discussions of biostatistics 
all assume that the analysis is targeted towards 
a cross- sectional study, in which the primary 
goal is diagnostic gene discovery (gene or fea- 
ture selection). In other words, a series of 
microarrays with a very large number of tran- 
scripts defines the very small minority of 
genes that are correlated and therefore predic- 
tive of the biological variable of interest. 
There are alternatives to this standard experi- 
mental design that use entirely different types 
of analysis, and the statistical issues are also 
quite different, as explained below. 

The rime-series study, if done with enough 
time points, can provide an effective antidote 
to the curse of dimensionality — the action of 
any gene during a time-series study should 
make biological sense, such that each signal is 
relatively easily discernible from noise. Visual 
query of a large time-series data set for single 
gene responses to the controlled variable either 
might meet expectations and is therefore valid* 
or might not meet expectations and is dis- 
carded as uninteresting. As an example, we 
show a time-series study in which rats are given 
a bolus of methyiprednisolone, after which 
their liver and muscle are studied as a function 
of time (fig. 2). In this case, the same gene (gori) 
is queried using a web-based dynamic visual- 
ization tool, first in liver (FIG. 2a) and then in 
muscle (fig. 2b). The data in the top panel are 
visually compelling; gotl in liver responds 
quickly and strongly to a bolus of 3-mcthyl- 
prednisolone, with relatively consistent repli- 
cates (each data point comes from a different 
animal) and a time course that is visually assur- 
ing so that complex statistical tests of the tran- 
scriptional response as a function of rime are 
not needed. On the other hand, the same gene 
in musde does not seem to respond to the 
drug* 31 (FIG. 2b). Through such gene queries, 
the variability in replicates and the appropri- 
ateness of the action of the gene as a function 
of time can quickly be assessed. Another 
advantage of time-series data is that such pro- 
files act as biomarkers that are amenable to 
analysis and interpretation using pharmacody- 
namic models that predict the underlying 
mechanisms of control of gene expression 31 . 

The Working Group agreed that data-rich, 
time-series experimental designs provide 
some latitude in reporting significant findings 
and that the query of individual genes within 
large data sets can circumvent complex issues 
of multidimensional^ of data. 

Future areas of development 

The data-rich and highly dimensional nature of 
microarray data serves as a model for future 
dissection and understanding of biological 
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systems in general, including protcomics and 
integration ofmRNA profiling and proteomics. 
The Working Group discussed data analysis 
needs within the microarray community and 
agreed that> along with the incorporation of 
QC SOPs and optimized or customized sig- 
nal/noise analyses in initial project signal gener- 
ation, the back-end statistics needs to reach a 
commonly accepted method of dealing with the 
curse of dimensionality be fore microarrays can 
be reliably used in clinical trials. Statisticians 
need to focus more on representation of pre- 
diction results in terms of probabilities and 
associated measures of uncertainty, and reach a 
consensus on what is acceptable. In the mean- 
time, it is likely that specific marker or diagnos- 
tic genes will be extracted from pQot profiling 
studies, and then only this small subset of genes 
will be used as a clinical trial endpoint This data 
limitation approach removes much of the curse 
of dimensionality, but is liable to ignore the 
large majority of data, thereby decreasing the 
potential power of the study and bringing into 
question the use of rnkroarrays in dtnical trials. 

A move towards the standardization of 
reporting of prediction accuracy would be 
desirable when assessing predictive accuracy 
through within-sample cross-validation. The 
Working Group suggests that one or more val- 
idation techniques be used when reporting 
predictive genes: leave-one-out and 10% 
cross-validation summaries, or true validation 
data sets. Communicating uncertainty about 
predictive performance is also key and will 
help evaluate results based on varying sample 
sizes. The Working Group suggests that until 
this information is routinely presented in pub- 
lished papers, it will be difficult to reach an 
acceptable consensus for use in clinical trials. 

Conclusions 

There arc four key areas of optimization and 
standardization that are largely independent: 
study design, technical variability (QC/SOP of 
data generation), analysis method variation 
(signal/noise optimization using probeset 
algorithms and normalizations) and back-end 
statistical analyses. Statistics of clinical trial 
design is crucial: gene -expression data does 
not mitigate the need for sound and relevant 
design and analysis, nor does it challenge what 
we know about design. The field is quickly 
maturing from the small-chip-number hit- 
and-run type projects to those with a more 
robust study design. However, study design 
depends ultimately on appropriate powering 
of a study, which is greatly affected by both 
the chip-analysis algorithms that are used and 
the biostatistical data analysis. 

Development of back-end statistical meth- 
ods for data representation/summary and for 



high-level analysis remains an active area of 
research for both academic and commercial 
users, and is likely to remain so in the near 
future. We are some way from defining stan- 
dards of summary signal intensities alone 
and even further from considerations of 
standardization of analytical methods for 
inference and prediction in clinical contexts. 
In regulated clinical studies, such standards 
will be enforced partly by the US FDA as sub- 
missions of medical test/device protocols 
emerge and increase in number. Even then, 
however, many approaches to data analysis 
and modelling will be used and developed, 
which is, of course, to be supported. It is very 
difficult to influence the research commu- 
nity, especially when the variety of problems 
that are encountered promotes the need for 
refined and new approaches. 
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Figure 6-3 Genes can be expressed 
with different efficiencies* Gene A is 
transcribed and translated much more 
efficiently than gene B.Thte allows the 
amount of protein A In the cell to be 
much greater than that of protein B, 
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FROM DNA TO RNA 

Transcription and translation are the means by which cells read out, or express, 
the genetic instructions in their genes. Because many identical RNA copies can 
be made from the same gene, and each RNA molecule can direct the synthesis 
of many identical protein molecules, cells can synthesize a large amount of 
protein rapidly when necessary. But each gene can also be transcribed and 
translated with a different efficiency, allowing the cell to make vast quantities of 
some proteins and tiny quantities of others (Figure 6-3). Moreover, as we see in 
the next chapter, a cell can change (or regulate) the expression of each of its 
genes according to the needs of the moment— most obviously by controlling 
the production of its RNA 

Portions of DNA Sequence Are Transcribed into RNA 

The first step a cell takes in reading out a needed part of its genetic instructions 
is to copy a particular portion of its DNA nucleotide sequence— a gene— into an 
RNA nucleotide sequence. The information in RNA, although copied into another 
chemical form, is still written in essentially the same language as it is in DNA — 
the language of a nucleotide sequence. Hence the name transcription. 

like DNA, RNA is a linear polymer made of four different types of nucleotide 
subunits linked together by phosphodiester bonds [Figure 6M). It differs from 
DNA chemically in two' respects: (1) the nucleotides in RNA are 
ribonucleotides — that is, they contain the sugar ribose (hence the name ribonu- 
cleic acid) rather than deoxyribose; (2) although, like DNA, RNA contains the 
bases adenine (A), guanine (G), and cytosine {Q, it contains the base uracil (U) 
instead of the thymine (T) in DNA. Since U, like X can base-pair by hydrogen- 
bonding with A (Figure 6-5), the complementary base-pairing properties 
described for DNA in Chapters 4 and 5 apply also to RNA On RNA, G pairs with 
C, and A pairs with U). It is not uncommon, however, to find other types of base 
pairs in RNA for example, G pairing with U occasionally. 

Despite these small chemical differences, DNA and RNA differ quite dra- 
matically in overall structure. Whereas DNA always occurs in cells as a double- 
stranded helix, RNA is single-stranded. RNA chains therefore fold up into a 
variety of shapes, just as a polypeptide chain folds up to form the final shape of 
a protein (Figure 6-6). As we see later in this chapter, the ability to fold into com- 
plex three-dimensional shapes allows some RNA molecules to have structural 
and catalytic functions. 



Transcription Produces RNA Complementary to 
One Strand of DNA 

AH of the RNA in a cell is made by DNA transcription, a process that has cer- 
tain similarities to the process of DNA replication discussed in Chapter 5. 
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Figure 6-89 Protein aggregates that cause human disease, (A) Schematic Illustration of the type of 
conformational change in a protein that produces material for a crow-beta filament. (B) Diagram illustrating 
the self-Infectious nature of the protein aggregation that is central to prion diseases. PrP is highly unusual 
because the mtsfolded version of the protein, called PrP*, Induces the normal PrP protein it contacts' to 
change its conformation, as shown. Most of the human cfiseases caused by protein aggregation are caused by 
the overproduction of a variant protein that is especially prone to aggregation, but because this structure Is 
not Infectious In this way, it cannot spread from one animal to another. (Q Drawing of a cross-beta filament, 
a common type of protease-reslstant protein aggregate found In a variety of human neurological diseases. 
Because the hydrogen-bond interactions In a p sheet form between polypeptide backbone atoms (see Figure 
3-9), a number of different abnormally folded proteins can produce this structure. (D) One of several 
possible models for the conversion of PrP to PrP* showing the likely change of two o>heliees into four 
p-sVands. Although the structure of the normal protein has been determined accurately, the structure of the 
infectious form is not yet known with certainty because the aggregation has prevented the use of standard 
structural techniques. (C, courtesy of Louise SerpeU. adapted from M. Sunde et al., J. AloJ. Biol 273:729-739, 
1 997; O, adapted from SlB. rVusiner, Trends Bfocnem. Sol 21 :482-487, 1 996.) 

■ 

animals and humans. It can be dangerous to eat die tissues of animals that con- 
tain PrP*, as witnessed most recently by the spread of BSE (commonly referred 
to as die "mad cow disease") from cattle to humans in Great Britain, 

Fortunately, in the absence of PrP*, PrP is extraordinarily difficult to convert 
to its abnormal form. Although very few proteins have the potential to misfold 
into an infectious conformation, a similar transformation has been discovered 
to be the cause of an otherwise mysterious "protein-only inheritance 1 ' observed 
in yeast cells. 

There Are Many Steps From DN A to Protein 

We have seen so fax in this chapter that many different types of chemical reac- 
tions are required to produce a properly folded protein from the information 
contained in a gene (Figure 6-90). The final level of a properly folded protein in 
a cell therefore depends upon the efficiency with which each of the many steps 
is performed 

We discuss in Chapter 7 that cells have the ability to change the levels of 
their proteins according to their needs. In principle, any or all of the steps in Fig- 
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Figure 6-90 The production of a 
protein by a eucaryotic eel L The f ma | 
level of each protein in a eucaryotic cell 
depends upon the efficiency of each step 
depicted. 



ure 6-90) could be regulated by the cell for each individual protein. However, as 
we shall see in Chapter 7, the initiation of transcription is die most common 
point for a cell to regulate the expression of each of its genes. This makes sense, 
inasmuch as the most efficient way to keep a gene from being expressed is to 
block the very first step— the transcription of its DNA sequence into an RNA 
molecule. 



Summary 

The translation of the nucleotide sequence of an mRNA molecule into protein takes 
place in the cytoplasm on a large ribonucleoprotein assembly called a ribosome. The 
amino acids used for protein synthesis are first attached to a family of tRNA 
molecules, each of which recognizes, by complementary base-pair interactions, par* 
ttcular sets of three nucleotides in the mRNA (codons). The sequence of nucleotides in 
the mRNA is then read from one end to the other in sets of three according to the 
genetic code* 

7b initiate translation, a small ribosomal subunit binds to the mRNA molecule 
at a start codon (AUG) that is recognized by a unique initiator tRNA molecule. A 
large ribosomal submit binds to complete the ribosome and begin the elongation 
phase of protein synthesis. During this phase, aminoacyl tRNAs—each bearing a 
specific amino acid bind sequentially to the appropriate codon in mRNA by firming 
complementary base pa irs with the tRNA an Ucodon. Bach amino add is added to the 
C-terminal end of the growing polypeptide by means of a cycle of three sequential 
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Figure 7-5 Six steps at which 
cucsuyottc gene expression can be 
controlled. Controls that operate at 
steps I through 5 are discussed In this 
chapter. Step 6, the regulation of protein 
activity, Includes reversible activation or 
Inactlvatlon by protein phosphorylation 
(discussed In Chapter 3) as well as 
irreversible inactlvation by proteolytic 
degradation (discussed in Chapter 6). 



Gene Expression Can Be Regulated at Many of the Steps 
In the Pathway from DNA to RNA to Protein 

If differences among the various cell types of an organism dep end on the partic- 
ular genes that the cells express, at what level is the control of gene expression 
exercised? As we saw in the last chapter, there are many steps in the pathway 
leadfog from DNA to protein, and all of them can in principle be regulated Tlius 
a cell can control the proteins it makes by (1) controlling when and how often a 
given gene is transcribed (transcriptional control), (2) controlling how the RNA 
transcript is spliced or otherwise processed (RNA processing control), (3) 
selecting which completed mRNAs in the cell nucleus are exported to the cytosoi 
and detennining where in the cytosol they are localized (RNA transport and 
locaUzatton control), (4) selectingwhich mRNAs in the cytoplasm are translated 
byribosomes (translational control), (5) selectively destabilizing certain mRNA 
molecules in the cytoplasm (mRNA degradation control), or (6) selectively acti- 
vating, inactivating, degrading, or compartmentalizing specific protein 
molecules after they have been made (protein activity control) (Figure 7-5). 

For most genes transcriptional controls are paramount. This makes sense 
because, of all the possible control points illustrated in Figure 7-5, only tran- 
scriptional control ensures mat the cell win not synthesize superfluous interme- 
diates. In the following sections we discuss the DNA and protein components 
that perform this function by regulating the initiation of gene transcription. We 
e^eStoa ^ ° f diapter tD me additional ways <* regulating gene 

Summary 

^Seno^ofajea contains in its Dm sequence the information to make many 
thousands ofdifferent protein and RNA molecules. A cell typically expresses only 
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fraction of its genes, and the different types ofcellsin multicellular organisms arts, 
because different sets of genes are expressed. Moreover, cells can change the pattern 
of genes they express in response to changes in their environmen % suck as signals 
from other cells. Although all of the steps involved in expressing a gene can in prin- 
ciple be regulated, for most genes the initiation of UNA transcription is the most 
.important point of control 

DNA-BIND1NG MOTIFS IN GENE REGULATORY 
PROTEINS 

How does a cell determine which of its thousands of genes to transcribe? As 
^mentioned briefly in Chapters 4 and 6, the transcription of each gene is con- 
raued by aregulatory region of DNA relatively near the site where transcription 
.oegais. Some regulatory regions are simple and act as switches that are thrown 
oy a single signal. Many others are complex and act as tiny microprocessors, 
.responding to a variety of signals that they interpret and integrate to switch the 
«eignbormg gene on or off. me^ 
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occur In the germ line, the cell lineage that gives rise to sperm or eggs. Most of 
the DNA in vertebrate germ cells is inactive and highly methylated. Over long 
periods- of evolutionary time, the methylated CG sequences in these inactive . 
regions have presumably been lost through spontaneous deamination events 
that were not properly repaired- However promoters of genes that remain active 
in the germ cell lineages " (including most housekeeping genes) are kept 
unmethyiated, and therefore spontaneous deaminations of Cs that occur with- 
in them can be accurately repaired. Such regions are preserved in modern day 
vertebrate cells as CG islands. In addition, any mutation of a CG sequence in the 
genome that destroyed the function or regulation of a gene in the adult would be 
selected against, and some CG islands are simply the result of a higher than nor- 
mal density of critical CG sequences. 

pie mammalian genome contains an estimated 20,000 CG islands. Most of 
the islands mark the 5' ends of transcription units and thus, presumably, of 
genes. The presence of CG islands often provides a convenient way of identify- 
ing genes in the DNA sequences of vertebrate genomes. 

Summary 

The many types of cells in animals and plants are created largely through mecha- 
nisms that cause different genes to be transcribed in different ceils. Since many 
specialized animal cells can maintain their unique character through many cell 
division cycles and even when grown in culture, the gene regulatory mechanisms 
involved in creating them must be stable once established and heritable when the 
ceUdivides. These features endow the cell with a memory of its developmen tal history. 
Bacteria and yeasts provide unusually accessible model systems in which to study 
gene regulatory mechanisms. One such mechanism involves a competitive interac- 
tion between two gene regulatory proteins, each of which inhibits the synthesis of the 
other, this can create a flip-flop switch that switches a cell between two alternative 
patterns of gene expression. Direct or indirect positive feedback loops, which enable 
gene regulatory proteins to perpetuate their own synthesis, provide a general mech- 
anism for cell memory. Negative feedback loops with programmed delays form the 
basis for cellular clocks. 

In eucaryotes the transcription of a gene is generally controlled by combinations 
of gene regulatory proteins. It is thought that each type of cell in ah igher eucaryotic 
organism contains a specific combination of gene regulatory proteins that ensures 
the expression of only those genes appropriate to that type of cell A given gene regu- 
latory protein may be active in a variety of circumstances and typically is involved 
in the regulation of many genes. ' 

In addition to diffusible gene regulatory proteins, inherited states of chromatin 
condensation are also used by eucaryotic cells tb regulate gene expression. An espe- 
cially dramatic case is the inactivation of an entire X chromosome in female mam- 
mals. In vertebrates DNA methylation also functions in gene regulation, being used 
mainly as a device to reinforce decisions about gene expression that are made ini- 
tially by other mechanisms. DNA methylation also underlies the phenomenon of 
genomic imprinting in mammals, in which the expression of a gene depends on 
whether it was inherited from the mother or the father. 
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Figure 7-86 A mechanism to explain 
both the marked overall deficiency 
of CG sequences and their clustering 
Into CG islands in .vertebrate 
genomes. A fefacfc Une marks the location 
of a CG dinudeotide In the DNA 
sequence, while a red "lollipop" indicates 
the presence of a methyl group on the 
CG dinudeotide. CG sequences that lie In 
regulatory sequences of genes that are 
transcribed in germ cells are unmethyiated 
and therefore tend to be retained in 
evolution. Methylated CG sequences, on 
the other hand, tend to be lost through 
deamination of 5-methyi C to X unless the 
CG sequence is critical for survival. 



POSTTRANSCRIPTIONAL CONTROLS 

In principle, every step required for the process of gene expression could be 
controlled. Indeed, one can find examples of each type of regulation, although 
any one gene is likely to use only a few of them. Controls on the initiation of 
gene transcription are die predominant form of regulation for most genes. But 
other controls can act later in the pathway from DNA to protein to modulate 
the amount of gene product that is made. Although these p^sttranscripdonal 
controls, which operate after RNA polymerase has bound to the gene's promoter 
and begun RNA synthesis, are less common than transcriptional control, for 
many genes they are crucial 
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Differential Tissue Expression Distribution" which is described in EXAMPLE 18 in the 
specification that were used to identify differences in gene expression between tumor tissue and 
their normal counterparts. 

3. My scientific Curriculum Vitae, including my list of publications, is attached to 
and forms part of this Declaration (Exhibit A). 

4. In differential gene expression studies, one looks for genes whose expression levels 
differ significantly under different conditions, for example, in normal versus diseased tissue. 
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Chromosomal aberrations, such as gene amplification, and chromosomal translocations are 
important markers of specific types of cancer and lead to the aberrant expression of specific 
genes and their encoded polypeptides, including over-expression and under-expression. For 
example, gene amplification is a process in which specific regions of a chromosome are 
duplicated, thus creating multiple copies of certain genes that normally exist as a single copy. 
Gene under-expression can occur when a gene is not transcribed into mRNA. In addition, 
chromosomal translocations occur when two different chromosomes break and are rejoined to 
each other chromosome resulting in a chimeric chromosome which displays a different expression 
pattern relative to the parent chromosomes. Amplification of certain genes such as Her2/Neu 
[Singletons al, Pathol. Annu. . 27Ptl: 165-190], or chromosomal translocations such as t(5;14), 
[Grimaldi et aL Blood 73(8):2081-2085(1989); Meeker et al y Blood 76(2):285-289(1990)] give 
cancer cells a growth or survival advantage relative to normal cells, and might also provide a 
mechanism of tumor cell resistance to chemotherapy or radiotherapy. When the chromosomal 
aberration results in the aberrant expression of a mRNA and the corresponding gene product (the 
polypeptide), as it does in the aforementioned cases, the gene product is a promising target for 
cancer therapy, for example, by the therapeutic antibody approach. 

5. Comparison of gene expression levels in normal versus diseased tissue has 
important implications both diagnostically and therapeutically. For example, those who work in 
this field are well aware that in the vast majority of cases, when a gene is over-expressed, as 
evidenced by an increased production of mRNA, the gene product or polypeptide will also be 
over-expressed. It is unlikely that one identifies increased mRNA expression without associated 
increased protein expression. This same principle applies to gene under-expression. When a 
gene is under-expressed, the gene product is also likely to be under-expressed Stated in another 
way, two cell samples which have differing mRNA concentrations for a specific gene are 
expected to have correspondingly different concentration of protein for that gene. Techniques 
used to detect mRNA, such as Northern Blotting, Differential Display, in situ hybridization, 
quantitative PGR, Taqman, and more recently Microarray technology all rely on the dogma that a 
change in mRNA will represent a similar change in protein. If this dogma did not hold true then 
these techniques would have little value and not be so widely used. The use of mRNA 
quantitation techniques have identified a seemingly endless number of genes which are 
differentially expressed in various tissues and these genes have subsequently been shown to have 
correspondingly similar changes in their protein levels. Thus, the detection of increased mRNA 
expression is expected to result in increased polypeptide expression, and the detection of 
decreased mRNA expression is expected to result in decreased polypeptide expression. The 
detection of increased or decreased polypeptide expression can be used for cancer diagnosis and 
treatment. 

6. However, even in the rare case where the protein expression does not correlate 
with the mRNA expression, this still provides significant information useful for cancer diagnosis 
and treatment. For example, if over- or under-expression of a gene product does not correlate 
with over- or under-expression of mRNA in certain tumor types but does so in others, then 
identification of both gene expression and protein expression enables more accurate tumor 
classification and hence better determination of suitable therapy. In addition, absence of over- or 
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under-expression of the gene product in the presence of a particular over- or under-expression of 
mRNA is crucial information for the practicing clinician. For example, if a gene is over-expressed 
but the corresponding gene product is not significantly over-expressed, the clinician accordingly 
will decide jiot to treat a patient with agents that target that gene product 

7. I hereby declare that all statements made herein of my own knowledge are true and 
that all statements made on information or belief are believed to be true, and further that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 1 8 of the United States 
Code and that such willful statements may jeopardize the validity of the application or any 
patent issued thereon. 
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DECLARATION OF PAUL POLAKIS, Ph.D, 
L. Paul Polakis, Ph.D., declare and say as follows: )' 

1. I was awarded a PhD by the Department of Biochemistry of the Michigan 
State University in 1 984. My scientific Curriculum Vitae is attached to and forms 
part of this Declaration (Exhibit A). 

2. I am currently employed by Genentech, Inc. where my job title is Staff 
Scientist. Since joining Genentech in 1999, one of my primary responsibilities has 
been leading Genentech's Tumor Antigen Project, which is a large research project 
with a primary focus on identifying tumor cell markers that find use as targets for 
both the diagnosis and treatment of cancer in humans. 

3. As part of the Tumor Antigen Project, my laboratory has been analyzing 
differential expression of various genes in tumor cells relative to normal cells. 
The purpose of this research is to identify proteins, that are abundantly expressed 
on certain tumor cells and that are either (i) not expressed, or (ii) expressed at 
lower levels, on corresponding normal cells. We call such differentially expressed 
proteins "tumor antigen proteins". When such a tumor antigen protein is 
identified, one can produce an antibody that recognizes and binds to that protein. 
Such an antibody finds use in the diagnosis of human cancer and may ultimately 
serve as an effective therapeutic in the treatment Of human cancer. 

4. In the course of the research conducted by Genentech's Tumor Antigen 
Project, we have employed a variety of scientific techniques for detecting and 
studying differential gene expression in human tumor cells relative to normal cells 
at genomic DNA, mRNA and protein levels. An important example of one such 
technique is the well known and widely used technique of microarray analysis 
whjch has proven to be extremely useful for the identification of mRNA molecules 
that are differentially expressed in one tissue or cell type relative to another. In the 
course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor cells at 
significantly higher levels than in corresponding normal human cells. To date we 
have generated antibodies that bind to about 30 of the tumor antigen/proteins ' ' 
expressed from these differentially expressed gene transcripts and have used these 
antibodies to quantitatively determine the level of production of these tumor 
antigen proteins in both human cancer cells and corresponding normal cells. We 
have then compared the levels of mRNA and protein in both the tumor and normal 
cells analyzed. 

5. From the mRNA and protein expression analyses described in paragraph 4 
above, we have observed that there is a strong correlation between changes in the 
level of mRNA present in any particular cell type and the level of protein 



expressed from that mRNA. in that cell type. In approximately 80% of our . 
observations we have found mat increases in the level of a particular mRNA 
correlates with changes in the level of protein expressed from that mRNA when 
human tumor cells are compared with their corresponding normal cells. 

6. Based upon my own experience accumulated in more than 20 years of 
research, including the data discussed in paragraphs 4 and 5 above and my 
knowledge of the relevant scientific literature, it is my considered scientific 
opinion that for human genes, an increased level of mRNA in a tumor cell relative 

. to a normal cell typically correlates to a similar increase in abundance of the 
encoded protein in the tumor cell relative to the normal cell. In fact, it remains a 
central dogma in molecular biology that increased mRNA levels are predictive of 
corresponding increased levels of the. encoded protein. While there have been 
published reports of genes for which such a correlation does not exist, it is my 
opinion that such reports are exceptions to the commonly understood general rule 
that increased mRNA levels are predictive of corresponding increased levels of the 
encoded protein. 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be true, 
and further that these statements were made with the knowledge that willful mlse 
statements and the like so made are punishable by fine or imprisonment, or both 
under Section 1 00 1 of Title 18 of the United States Code and that such willful 
statements may jeopardize the validity of the application or any patent issued 
thereon. 
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^tracts. If these minor cell proteins differ among cells to the same extent as the 
?moie abundant proteins, as is commonly assumed, only a small number of pro- 
tein di2er ences (perhaps several hundred) suffice to create very large differences 
in cell morphology and behavior. 

A Cell Can Change the Expression of Its Genes 
in Response to External Signals 3 

Most of the specialized cells in a multicellular organism are capable of altering 
their patterns of gene expression in response to extracellular cues. If a liver cell 
is exposed to a glucocorticoid hormone, for example, the production of several 
specific proteins is dramatically increased. Glucocorticoids are released during 
periods of starvation or intense exercise and signal the liver to increase the 
production of glucose from amino acids and other small molecules; the set of 
proteins whose production is induced includes enzymes such as tyrosine amino- 
transferase, which helps to convert tyrosine to glucose. When the hormone is no 
longer "present, the production of these proteins drops to its normal level. 

Other cell types respond to glucocorticoids in different ways. In fat cells, for 
example, the production of tyrosine aminotransferase is reduced, while some 
other cell types do not respond to glucocorticoids at ail. These examples illustrate 
a general feature of cell specialization— different cell types often respond in dif- 
ferent ways to the same extracellular signal. Underlying this specialization are 
features that do not change, which give each cell type its permanently distinc- 
tive character. These features reflect the persistent expression of different sets of 
genes. 



Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protein 4 

If differences between the various cell types of an organism depend on the par- 
ticular genes that the cells express, at what level is the control of gene expression 
exercised? There are many steps in the pathway leading from DNA to protein, and 
all of them can in principle be regulated. Thus a cell can control the proteins it 
makes by (1) controlling when and how often a given gene is transcribed (tran- 
scriptional control), (2) controlling how the primary RNA transcript is spliced or 
otherwise processed (RNA processing control), (3) selecting which completed 
mRNAs in the cell nucleus are exported to the cytoplasm (RNA transport con- 
trol), (4) selecting which mRNAs in the cytoplasm are translated by ribosomes 
{translational control), (5) selectively destabilizing certain mRNA molecules in 
foe cytoplasm (mRNA degradation control), or (6) selectively activating, inacti- 
vating, or compartmentalizing specific protein molecules after they have been 
made (protein activity control) (Figure 9-2). 

For most genes transcriptional controls are paramount. This makes sense 
Because, of all the possible control points illustrated in Figure 9-2, only transcrip- 
onal control ensures that no superfluous intermediates are synthesized. In the 
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Figure 9-2 Six steps at which 
eucaryote gene expression can be 
controlled* Only controls that operate 
at steps 1 through 5 are discussed in 
this chapter. The regulation of protein 
activity (step 6) is discussed in 
Chapter 5; this includes reversible 
activation or inactivation by protein 
phosphorylation as well as 
irreversible inactivation by proteolytic 
degradation. 
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following sections we discuss the DNA and protein components that regulate the 
initiation of gene transcription. We return at the end of the chapter to the other 
ways of regulating gene expression. 



Summary 

The genome of a cell contains in its DNA sequence the information to make many 
thousands of different protein and RNA molecules. A cell typically expresses only a 
fraction ofits genes, and the different types of cells in multicellular organisms arise 
because different sets of genes are expressed. Moreover, cells can change the pattern 
of genes they express in response to changes in their environment, such as signals from 
other cells. Although all of the steps involved in expressing a gene can in principle be 
regulated, for most genes the initiation of RNA transcription is the most important 
point of control 



DNA-binding Motifs in Gene 
Regulatory Proteins 5 

How does a cell determine which of its thousands of genes to transcribe? As dis- 
cussed in Chapter 8, the transcription of each gene is controlled by a regulatory 
region of DNA near the site where transcription begins. Some regulatory regions 
are simple and act as switches that are thrown by a single signal. Other regula- 
tory regions are complex and act as tiny microprocessors, responding to a vari- 
ety of signals that they interpret and integrate to switch the neighboring gene on 
or off. Whether complex or simple, these switching devices consist of two fun- 
damental types of components: (1) short stretches of DNA of defined sequence 
and (2) gene regulatory proteins that recognize and bind to them. 

We begin our discussion of gene regulatory proteins by describing how these 
proteins were discovered. 

Gene Regulatory Proteins Were Discovered Using 
Bacterial Genetics 6 

Genetic analyses in bacteria carried out in the 1950s provided the first evidence 
of the existence of gene regulatory proteins that turn specific sets of genes on 
or off. One of these regulators, the lambda repressor, is encoded by a bacterial 
virus, bacteriophage lambda. The repressor shuts off the viral genes that code for 
the protein components of new virus particles and thereby enables the viral ge- 
nome to remain a silent passenger in the bacterial chromosome, multiplying with 
the bacterium when conditions are favorable for bacterial growth (see Figure 
6-80). The lambda repressor was among the first gene regulatory proteins to be 
characterized, and it remains one of the best understood, as we discuss later. 
Other bacterial regulators respond to nutritional conditions by shutting off genes 
encoding specific sets of metabolic en2ymes when they are not needed. The lac 
repressor, for example, the first of these bacterial proteins to be recognized, turns 
off the production of the proteins responsible for lactose metabolism when this 
sugar is absent from the medium. 

The first step toward understanding gene regulation was the isolation of 
mutant strains of bacteria and bacteriophage lambda that were unable to shut 
off specific sets of genes. It was proposed at the time, and later proved, that most 
of these mutants were deficient in protfeins acting as specific repressors for these 
sets of genes. Because these proteins, like most gene regulatory proteins, are 
present in small quantities, it was difficult and time-consuming to isolate them. 
They were eventually purified by fractionating cell extracts on a series of stan- 
dard chromatography columns (see pp. 166-169). Once isolated, the pro- 
teins were shown to bind to specific DNA sequences close to the genes that they 
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Summary 

The many types of cells in animals and plants are created largely through mecha- 
nisms that cause different genes to he transcribed in different cells. Since many spe- 
cialized animal cells can maintain their unique character when grown in culture, the 
gene regulatory mechanisms involved in creating them must be stable once estab- 
lished and heritable when the cell divides, endowing the cell with a memory of its 
developmental history. Procaryotes and yeasts provide unusually accessible model 
systems in which to study gene regulatory mechanisms, some of which may be rel- 
evant to the creation of specialized cell types In higher eucaryotes. One such mecha- 
nism involves a competitive interaction between two (or more) gene regulatory pro- 
teins, each of which inhibits the synthesis of the other, this can create a flip-flop 
switch that switches a cell between two alternative patterns of gene expression. Di- 
rector indirect positive feedback loops, which enable gene regulatory proteins to 
perpetuate their own synthesis, provide a general mechanism for cell memory. 

In eucaryotes gene transcription is generally controlled by combinations of gene 
regulatory proteins. It is thought that each type of cell in a higher eucaryotic organism 
contains a specific combination of gene regulatory proteins that ensures the expres- 
sion of only those genes appropriate to that type of cell A given gene regulatory pro- 
tein may be expressed in a variety of circumstances and typically is involved in the 
regulation of many genes. 

in addition to diffusible gene regulatory proteins, inherited states of chromatin 
condensation are also utilized by eucaryotic cells to regulate gene expression. In ver- 
tebrates DNA methylation also plays a part, mainly as a device to reinforce decisions 
about gene expression that are made initially by other mechanisms. 



Posttranscriptional Controls 

Although controls on the initiation of gene transcription are the predominant 
form of regulation for most genes, other controls can act later in the pathway 
from RNA to protein to modulate the amount of gene product that is made. Al- 
jtougfr these posttranscriptional controls, which operate after RNA polymerase 
jjas bound to the gene's promoter and begun RNA synthesis, are less common 
^transcriptional control, fox many genes they are crucial. It seems that every 
step in gene expression that could be controlled in principle is likely to be regu- 
ated under some circumstances for some genes. 

We consider the varieties of posttranscriptional regulation in temporal or- 

«j according to the sequence of events that might be experienced by an RNA 

°tecul e after its transcription has begun (Figure 9-72). 
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FROM DNATO RNA 

Transcription and translation are the means by which cells read out, or express, 
the genetic instructions in their genes. Because many identical RNA copies can 
be made from the same gene, and each RNA molecule can direct the synthesis 
of many identical protein molecules, ceils can synthesize a large amount of 
protein rapidly when necessary But each gene can also be transcribed and 
translated with a different efficiency, allowing the cell to make vast quantities of 
some proteins and tiny quantities of others (Figure 6-3). Moreover, as we see in 
the next chapter, a cell can change (or regulate) the expression of each of its 
genes according to the needs of the moment — most obviously by controlling 
the production of its RNA. 



Figure 6-3 Genes can be expressed 
with different efficiencies. Gene A is 
transcribed and translated much mare 
efficiently than gene B.This allows the 
amount of protein A in the cell to be 
much greater than that of protein B. 



Portions of DNA Sequence Are Transcribed into RNA 

The first step a cell takes in reading out a needed part of its genetic instructions 
is to copy a particular portion of its DNA nucleotide sequence— a gene— into an 
RNA nucleotide sequence. The information in RNA, although copied into another 
chemical form, is still written in essentially the same language as it is in DNA— 
the language of a nucleotide sequence. Hence the name transcription. 

Like DNA, RNA is a linear polymer made of four different types of nucleotide 
subunits linked together by phosphodiester bonds (Figure 6-4). It differs from 
DNA chemically in two' respects: (1) the nucleotides in RNA are 
ribonucleotides — that is, they contain the sugar ribose (hence the name ribonu- 
cleic acid) rather than deojcyribose; (2) although, like DNA, RNA contains the 
bases adenine (A), guanine (G), and cytosine (C), it contains the base uracil (U) 
instead of the thymine (T) in DNA. Since U, like T, can base-pair by hydrogen- 
bonding with A (Figure 6-5), the complementary base-pairing properties 
described for DNA in Chapters 4 and 5 apply also to RNA (in RNA, G pairs with 
C, and A pairs with U). It is not uncommon, however, to find other types of base 
pairs in RNA: for example, G pairing with U occasionally. 

Despite these small chemical differences, DNA and RNA differ quite dra- 
matically in overall structure. Whereas DNA always occurs in cells as a double- 
stranded helix, RNA is single-stranded. RNA chains therefore fold up into a 
variety of shapes, just as a polypeptide chain folds up to form the final shape of 
a protein (Figure 6-6) . As we see later in this chapter, the ability to fold into com- 
plex three-dimensional shapes allows some RNA molecules to have structural 
and catalytic functions. 



Transcription Produces RNA Complementary to 
One Strand of DNA 

All of the RNA in a cell is made by DNA transcription, a process that has cer- 
tain similarities to the process of DNA replication discussed in Chapter 5. 



302 Chapter 6 : HOW CELLS READ THE GENOME: FROM DNATO PROTEIN 




Figure 6-89 Protein aggregates that cause human disease. (A) Schematic illustration of the type of 
conformational change in a protein that produces material for a cross-beta filament. (B) Diagram illustrating 
the self-infectious nature of the protein aggregation that Is central to prion diseases. PrP is high!/ unusual 
because the misfolded version of the protein, called PrP*, induces the norma! PrP protein it contacts to 
change its conformation, as shown. Most of the human diseases caused by protein aggregation are caused by 
the overproduction of a variant protein that is especially prone to aggregation, but because this structure is 
not infectious in this way, it cannot spread from one animal to another. (C) Drawing of a cross-beta filament, 
a common type of protease- resistant protein aggregate found in a variety of human neurological diseases. 
Because the hydrogen-bond interactions in a (3 sheet form between polypeptide backbone atoms (see Figure 
3-9), a number of different abnormally folded proteins can produce this structure. (D) One of several 
possible models for the conversion of PrP to PrP*, showing the likeJy change of two a-helices into four 
p-strands. Although the structure of the normal protein has been determined accurately, the structure of the 
infectious form Is not yet known with certainty because the aggregation has prevented the use of standard 
structural techniques. (C, courtesy of Louise Serpeli, adapted from M. Sunde et al J. Mo/. Bfol 273:729-739, 
1 997; D, adapted from S.B. Prusiner, Trends fitocnem. Set. 21 :482-487, 1 996.) 

animals and humans. It can be dangerous to eat the tissues of animals that con- 
tain PrP*, as witnessed most recently by the spread of BSE (commonly referred 
to as the "mad cow disease") from cattle to humans in Great Britain, 

Fortunately, in the absence of PrP*, PrP is extraordinarily difficult to convert 
to its abnormal form. Although very few proteins have the potential to misfold 
into an infectious conformation, a similar transformation has been discovered 
to be the cause of an otherwise mysterious "protein-only inheritance" observed 
in yeast cells. 

There Are Many Steps From DNA to Protein 

We have seen so far in this chapter that many different types of chemical reac- 
tions are required to produce a properly folded protein from the information 
contained in a gene (Figure 6-90). The final level of a properly folded protein in 
a cell therefore depends upon the efficiency with which each of the many steps 
is performed. 

We discuss in Chapter 7 that cells have the ability to change the levels of 
their proteins according to their needs. In principle, any or all of the steps in Fig- 
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Figure 6-90 The production of a 
protein by a eucaryotic cell. The final 
level of each protein in a eucaryotic cell 
depends upon the efficiency of each step 
depicted. 
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ure 6-90) couid be regulated by the cell for each individual protein. However, as 
we shall see in Chapter 7, the initiation of transcription is the most common 
point for a cell to regulate the expression of each of its genes. This makes sense, 
inasmuch as the most efficient way to keep a gene from being expressed is to 
block the very first step— the transcription of its DNA sequence into an RNA 
molecule* 



Summary 

The translation of the nucleotide sequence of an mRNA molecule into protein takes 
place in the cytoplasm on a large ribonucleoprotein assembly called a ribosome. The 
amino acids used for protein synthesis are first attached to a family of tRNA 
molecules, each of which recognizes, by complementary base-pair interactions, par- 
ticular sets of three nucleotides in the mRNA (codons). The sequence of nucleotides in 
the mRNA is then read from one end to the other in sets of three according to the 
genetic code. 

To initiate translation, a small ribosomal subunit binds to the mRNA molecule 
at a start codon (AUG) that is recognized by a unique initiator tRNA molecule. A 
large ribosomal subunit binds to complete the ribosome and begin the elongation 
phase of protein synthesis. During this phase, aminoacyl tRNAs — each bearing a 
specific amino acid bind sequentially to the appropriate codon in mRNA by forming 
complementary base pairs with the tRNA anticodan. Bach amino acid is added to the 
C-terminal end of the growing polypeptide by means of a cycle of three sequential 
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Figure 7-5 Six steps at which 
eucaryotic gene expression can be 
controlled. Controls that operate at 
steps I through 5 are discussed ip this 
chapter. Step 6, the regulation of protein 
activity, includes reversible activation or 
inactlvation by protein phosphorylation 
(discussed in Chapter 3) as well as 
irreversible inactivation by proteolytic 
degradation (discussed in Chapter 6). 



Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protein 

If differences among the various cell types of an organism depend on the partic- 
ular genes that the cells express, at what level is the control of gene expression 
exercised? As we saw in the last chapter, there are many steps in the pathway 
leading from DNA to protein, and all of them can in principle be regulated. Thus 
a cell can control the proteins it makes by (1) controlling when and how often a 
given gene is transcribed (transcriptional control), (2) controlling how the RNA 
transcript is spliced or otherwise processed (RNA processing control), (3) 
selecting which completed mRNAs in the cell nucleus are exported to the cytosol 
and detenriining where in the cytosol they are localized (RNA transport and 
localization control), (4) selecting which mRNAs in the cytoplasm are translated 
by ribosomes (translauonal control), (5) selectively destabilizing certain mRNA 
molecules in the cytoplasm (mRNA degradation control), or (6) selectively acti- 
vating, inactivating, degrading, or compartmentalizing specific protein 
molecules after they have been made (protein activity control) (Figure 7-5). 

For most genes transcriptional controls are paramount This makes sense 
because, of all the possible control points illustrated in Figure 7-5, only tran- 
scriptional control ensures that the cell will not synthesize superfluous interme- 
diates. In the following sections we discuss the DNA and protein components 
that perform this function by regulating the initiation of gene transcription. We 
shall return at the end of the chapter to the additional ways of regulating gene 
expression. 



Summary 

The genome of a cell contains in its DNA sequence the information to make many 
thousands of different protein and RNA molecules. A cell typically expresses only a 
fraction of its genes, and the different types of cells in multicellular organisms arise 
because different sets of genes are expressed. Moreover, cells can change the pattern 
of genes they express in response to changes in their environment, such as signals 
from other cells. Although all of the steps involved in expressing a gene can in prin- 
ciple be regulated, for most genes the initiation of RNA transcription is the most 
. important point of control 



DNA-BINDING MOTIFS IN GENE REGULATORY 
PROTEINS 

How does a cell determine which of its thousands of genes to transcribe? As 
mentioned briefly in Chapters 4 and 6, the transcription of each gene is con- 
trolled by a regulatory region of DNA relatively near the site where transcription 
begins. Some regulatory regions are simple and act as switches that are thrown 
oy a single signal. Many others are complex and act as tiny microprocessors, 
.xesporiding to a variety of signals that they interpret and integrate to switch the 
•gnboring gene on or off. Whether complex or simple, these switching devices 
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occur in the germ line, the cell lineage that gives rise to sperm or eggs. Most of 
the DNA in vertebrate germ cells is inactive and highly methylated. Over long 
periods of evolutionary time, the methylated CG sequences in these inactive . 
regions have presumably been lost through spontaneous deamination events 
that were not properly repaired. However promoters of genes that remain active 
in the germ cell lineages (including most housekeeping genes) are kept 
unmethylated, and therefore spontaneous deaminations of Cs that occur with- 
in them can be accurately repaired Such regions are preserved in modern day 
vertebrate cells as CG islands. In addition, any mutation of a CG sequence in the 
genome that destroyed the function or regulation of a gene in the adult would be 
selected against, and some CG islands are simply the result of a higher than nor- 
mal density of critical CG sequences. 

The mammalian genome contains an estimated 20,000 CG islands. Most of 
the islands mark the 5' ends of transcription units and thus, presumably, of 
genes. The presence of CG islands often provides a convenient way of identify- 
ing genes in the DNA sequences of vertebrate genomes. 

Summary 

The many types of cells in animals and plants are created largely through mecha- 
nisms that cause different genes to be transcribed in different cells. Since many 
specialized animal cells can maintain their unique character through many cell 
division cycles and even when grown in culture, the gene regulatory mechanisms 
involved in creating them must be stable once established and heritable when the 
cell divides. These features endow the cell with a memory of its developmental history. 
Bacteria and yeasts provide unusually accessible model systems in which to study 
gene regulatory mechanisms. One such mechanism involves a competitive interac- 
tion between two gene regulatory proteins, each of which inhibits the synthesis of the 
other; this can create a flip-flop switch that switches a cell between two alternative 
patterns of gene expression. Director indirect positive feedback loops, which enable 
gene regulatory proteins to perpetuate their own synthesis, provide a general mech- 
anism for cell memory. Negative feedback loops with programmed delays form the 
basis far cellular clocks. 

In eucaryotes the transcription of a gene is generally controlled by combinations 
of gene regulatory proteins. It is thought that each type of cell in a higher eucaryotic 
organism contains a specific combination of gene regulatory proteins that ensures 
the expression of only those genes appropriate to that type of cell A given gene regu- 
latory protein may be active in a variety of circumstances and typically is involved 
in the regulation of many genes. 

In addition to diffusible gene regulatory proteins, inherited states of chromatin 
condensation are also used by eucaryotic cells to regulate gene expression. An espe- 
cially dramatic case is the inactivation of an entire X chromosome in female mam- 
mals. In vertebrates DNA methylation also Junctions in gene regulation, being used 
mainly as a device to reinforce decisions about gene expression that are made ini- 
tially by other mechanisms. DNA methylation also underlies the phenomenon of 
genomic imprinting in mammals, in which the expression of a gene depends on 
whether it was inherited from the mother or the father. 
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Figure 7-86 A mechanism to explain 
both the marked overall deficiency 
of CG sequences and their clustering 
into CG islands in vertebrate 
genomes. A black line marks die location 
of a CG dinucleotide in the DNA 
sequence, while a red "lollipop** indicates 
the presence of a methyl group on the 
CG dinucleotlde. CG sequences that lie in 
regulatory sequences of genes that are 
transcribed in germ cells are unmethylated 
and therefore tend to be retained In 
evolution. Methylated CG sequences, on 
the o^her hand, tend to be lost through 
deamination of 5-methyl C toT, unless the 
CG sequence is critical for survival. 



POSTTRANSCRI PTION AL CONTROLS 

In principle, every step required for the process of gene expression could be 
controlled. Indeed, one can rind examples of each type of regulation, although 
any one gene is likely to use only a few of them. Controls on the initiation of 
gene transcription are the predorninant form of regulation for most genes. But 
other controls can act later in the pathway from DNA to protein to modulate 
the amount of gene product that is made. Although these posttranscriptional 
controls, which operate after RNA polymerase has bound to the gene's promoter 
and begun RNA synthesis, are less common than transcriptional control, for 
rrtany genes they are crucial. 
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CHAPTER 29 

Regulation of transcription 
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j| T e phenotypic dilTcrences that distinguish the 
various kinds of cells in a higher eukaryole are 
largely due to differences in the expression of 
?e nes that code Tor proteins, thai is. those tran- 
scribed by RNA polymerase II. In principle, the 
expression or these genes might be regulated at 
a ny one of several stages. The concept of the 
-level or control" implies thai gene expression 
is not necessarily an automatic process once it 
has begun. It could he regulated in a gene- 
specific way at any one or several sequential 
steps. We can distinguish (at least) live poten- 
tial control points. farming the series: 

Activation of gene stniclure 
•1 

Initiation of transcription 
i 

Processing the Iraiiscrip.i 
X 

Transport, to cytoplasm 
I 

Translation or mKNA 

The existence of Ihe first step is implied by 
the discovery thai genes may exist in cither oV 
two structural conditions. Relative to the state 
or most of the genome, genes are found in 
mi -active" stale in the cells in which they 
are expressed {see Chapter 27). The change of 
structure is. distinct from Hie act or transcript 
lion, and indicates that the gene is nranserib- 
able." This suggests that acquisition or the 
"active" structure must be the first step in gene 
expression. 

Transcription of a gene in the active state is 



controlled at the stage or initiation, that is. by 
the interaction of RNA polymerase with its pro- 
moter. This is now becoming susceptible to 
analysis in the in vitro systems (see Chapter 
2H). For most genes, this* is a major control 
point: probably ii is the most common level of 
regulation. 

There is at present no evidence ror control 
at subsequent stages of transcription in eukary- 
olic cells, ror example, via anlitermination 
mechanisms. 

The primary transcript is modified by capping 
at the 5' end, and usually also by polyadenyla- 
lion at the 3' end. Introns must be spliced out 
from the transcripts or interrupted genes. The 
mature RNA must be exported from. the nucleus 
lo Ihe cytoplasm. Regulation or gene expression 
by selection of sequences at the level of nuclear 
RNA might involve any or all of these stages, 
but the one ror which we have most evidence 
concerns changes in splicing: some genes are 
expressed by means or alternative splicing pat- 
terns whose regulation controls the type or pro- 
tein product (see Chapter 30). 

Finally, the translation of an mRNA in the cyto- 
plasm can be specifically controlled. There is little 
evidence for the employment or this mechanism in 
adull somatic cells, but it does occur in some 
embryonic situations, as described in Chapter 7. 
The mechanism is presumed to involve the block- 
ing or initiation or translation of some mRNAs by 
specific protein factors, 

But having acknowledged that control or gene 
expression can occur at multiple stages, and 
that production or RNA cannot inevitably be 
equated with production or protein, it is clear 
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that the overwhelming majority of regulatory 
events occur at^the^ initiation of transcription. 
Regulation of tissue-specific gene transcription 
lies at the heart of eukaryotic differentiation; 
indeed, we see examples in Chapter 38 in 
which proteins that regulate embryonic devel- 
opment prove to be transcription factors. A reg- 
ulatory transcription factor serves to provide 



common control of a large number of target 
genes, and we seek to answer two questions 
about this mode of regulation: what identifies 
the common target genes to the transcription 
factor; and how is the activity of the transcrif>- 
tion factor itself regulated in response to intrin- 
sic or extrinsic signals? 



Response elements identify genes under common 
regulation 



The principle that emerges from characterizing 
groups of genes under common control is that 
they' share a prvmoter element that is recognized 
by a regulatory- tmnscription factor. An element 
that causes a gene to respond to such a factor 
is called a response element; examples are the 
HSE (heal shock response element), GRE 
(glucocorticoid response element), SRE (serum 
response element). 

The properties of some inducible transcription 
factors and the elements that they recognize are 
summarized in. Table 29,1. Response elements 
have the same general characteristics as 
upstream elements of promoters or enhancers. 
They contain short consensus sequences, and 
copies of the response elements found in dif- 
ferent genes are closely related, but not neces- 
sarily identical. The region bound by the factor 
extends for a short distance on either side of 



Table 29-1 Inducible transcription factors bind to 
response elements that identify groups ol promoters 
or enhancers subject to coordinate control. 



Regulatory Agent Module Consensus 



Factor 



Heat shock HSE CNNGAANNTCCNNG HSTF 

Glucocorticoid GRE TGGTACAAATGTTCT Receptor 

Phorbol ester TRE TGACTCA API 

Serum SRE CCATATTAGG SRF 



the consensus sequence. In promoters, the ele- 
ments are not present at fixed distances from 
the startpoint, but are usually <200 bp upstream 
of it. The presence of a single element usually 
is sufficient to confer the regulatory response, 
but sometimes there are multiple copies. 

Response elements may be located in P* 0 " 
moters or in enhancers. Some types of elements 
are typically found in one rather than the other 
usually an HSE is found in a promoter, while a 
GRE is round in an enhancer. We assume tM 
all response elements function by the san* 
general principle. A gene is regulated by 0 
sequence at the promoter or enhancer that » 
recognized by a specific protein, the pro**** 
Junctions as a transcription factor needed J* 
RNA polymerase to initiate. Active protein 
available only under conditions when the g*** 
to be expressed; its absence means thai the 
moter is not activated by this particular rirt* 1 _ 

An example of a situation in which fl* • 
genes are controlled by a single factor * s J*jj. 
vided by the heal shock response. This is c0 ^ 
mon to a wide range of prokary° leS c1 f 
eukaryotes and involves multiple contro is 
gene expression: an increase in temper 8 ^ 
turns off transcription of some genes, lurnS fll ,d 
transcription of the heal shock 6 eDCS, R s,v- 
causes changes in the translation of & ' !f « 
The control of the heat shock genes M* 5 * ^ 
the differences between prokary otic ^ 
eukaryotic modes of control In. bacteria, 3 ^4 
sigma factor is synthesized that d irectS a |ier 
polymerase holoenzyme to recognize a n - 
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Background: Prostate stem cell antigen (PSCA) is a recently defined homologue of the Thy-l/Ly-6 family of 
gtycosylphosphatidylinositol (GPI)-anchored cell surface antigens. The purpose of the present study was to 
examine the expression status of PSCA protein and mRNA in clinical specimens of human prostate cancer (Pea) 
and to validate it as a potential molecular target for diagnosis and treatment of Pea. 

Materials and Methods: Immunohistochemical (IHC) and in situ hybridization (ISH) analyses of PSCA 
expression were simultaneously performed on paraffin-embedded sections from 20 benign prostatic hyperplasia 
(BPH), 20 prostatic intraepithelial neoplasm (PIN) and 48 prostate cancer (Pea) tissues, including 9 androgen- 
independent prostate cancers. The level of PSCA expression was semiquantitarJvely scored by assessing both the 
percentage and intensity of PSCA-positive staining cells in the specimens. Then compared PSCA expression 
between BPH, PIN and Pea tissues and analysed the correlations of PSCA expression level with pathological grade, 
clinical stage and progression to androgen-independence in Pea. 

Results: In BPH and low grade PIN, PSCA protein and mRNA staining were weak or negative and less intense 
and uniform than that seen in HGPIN and Pea. There were moderate to strong PSCA protein and mRNA 
expression in 8 of 1 1 (72.7%) HGPIN and in 40 of 48 (83.4%) Pea specimens examined by IHC and ISH analyses, 
with statistical significance compared with BPH (20%) and low grade PIN (22.2%) samples (p < 0.05, respectively). 
The expression level of PSCA increased with high Gleason grade, advanced stage and progression to androgen- 
independence (p < 0.05, respectively). In addition. IHC and ISH staining showed a high degree of correlation 
between PSCA protein and mRNA overexpression. 

Conclusions: Our data demonstrate that PSCA as a new cell surface marker is overexpressed by a majority of 
human Pea. PSCA expression correlates positively with adverse tumor characteristics, such as increasing 
pathological grade (poor cell differentiation), worsening clinical stage and androgen-independence, and 
speculatively with prostate carcinogenesis. PSCA protein overexpression results from upregulated transcription 
of PSCA mRNA. PSCA may have prognostic utility and may be a promising molecular target for diagnosis and 
treatment of Pea. 
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Introduction 

Prostate cancer (Pea) is the second leading cause of can- 
cer-related death in American men and is becoming a 
common cancer increasing in China. Despite recently 
great progress in the diagnosis and management of local- 
ized disease, there continues to be a need for new diagnos- 
tic markers that can accurately discriminate between 
indolent and aggressive variants of Pea. There also contin- 
ues to be a need for the identification and characterization 
of potential new therapeutic targets on Pea cells. Current 
diagnostic and therapeutic modalities for recurrent and 
metastatic Pea have been limited by a lack of specific tar- 
get antigens of Pea. 

Although a number of prostate-specific genes have been 
identified (i.e. prostate specific antigen, prostatic acid 
phosphatase, glandular kallikrein 2), the majority of these 
are secreted proteins not ideally suited for many immuno- 
logical strategies. So, the identification of new cell surface 
antigens is critical to the development of new diagnostic 
and therapeutic approaches to the management of Pea. 

Reiter RE et al [1] reported the identification of prostate 
stem cell antigen (PSCA), a cell surface antigen that is pre- 
dominantly prostate specific. The PSCA gene encodes a 
123 amino acid glycoprotein, with 30% homology to 
stem cell antigen 2 (Sea 2). Like Sca-2, PSCA also belongs 
to a member of theThy-l/Ly-6 family and is anchored by 
a glycosylphosphatidylinositol (GPI) linkage. mRNA in 
$itu hybridization (ISH) localized PSCA expression in nor- 
mal prostate to the basal cell epithelium, the putative 
stem cell compartment of prostatic epithelium, suggesting 
that PSCA may be a marker of prostate stem/progenitor 
cells. 

In order to examine the status of PSCA protein and mRNA 
expression in human Pea and validate it as a potential 
diagnostic and therapeutic target for Pea, we used immu- 
nohistochemistry (IHC) and in situ hybridization (ISH) 
simultaneously, and conducted PSCA protein and mRNA 
expression analyses in paraffin-embedded tissue speci- 
mens of benign prostatic hyperplasia (BPH, n = 20), pros- 
tate intraepithelial neoplasm (PIN, n = 20) and prostate 
cancer (Pea, n = 48). Furthermore, we evaluated the possi- 
ble correlation of PSCA expression level with Pea tumori- 
genesis, grade, stage and progression to androgen- 
independence. 

Materials and methods 
Tissue samples 

All of the clinical tissue specimens studied herein were 
obtained from 80 patients of 57-84 years old by prostate- 
ctomy, transurethral resection of prostate (TURP) or biop- 
sies. The patients were classified as 20 cases of BPH, 20 
cases of PIN, 40 cases of primary Pea, including 9 patients 



with recurrent Pea and a history of androgen ablation 
therapy (orchiectomy and/or hormonal therapy), who 
were referred to as androgen-independent prostate can- 
cers. Eight specimens were harvested from these andro- 
gen-independent Pea patients prior to androgen ablation 
treatment. Each tissue sample was cut into two parts, one 
was fixed in 10% formalin for IHC and the other treated 
with 4% paraformaldehyde/0.1 M PBS PH 7.4 in 0.1% 
DEPC for 1 h for ISH analysis, and then embedded in par- 
affin. All paraffin blocks examined were then cut into 5 
um sections and mounted on the glass slides specific for 
IHC and ISH respectively in the usual fashion. H&E- 
stained section of each Pea was evaluated and assigned a 
Gleason score by the experienced urological pathologist at 
our institution based on the criteria of Gleason score [2]. 
The Gleason sums are summarized in Table 1. Clinical 
staging was performed according to Jewett-whitmore- 
prout staging system, as shown in Table 2. In the category 
of PIN, we graded the specimens into two groups, i.e. low 
grade PIN (grade I - II) and high grade PIN (HGPIN, 
grade III) on the basis of literatures [3,4). 

tmmunohistochemicat (IHC) analysis 

Briefly, tissue sections were deparaffinized, dehydrated, 
and subjected to microwaving in 10 mmol/L citrate 
buffer, PH 6.0 (Boshide, Wuhan, China) in a 900 W oven 
for 5 min to induce epitope retrieval. Slides were allowed 
to cool at room temperature for 30 min. A primary mouse 
antibody specific to human PSCA (Boshide, Wuhan, 
China) with a 1:100 dilution was applied to incubate with 
the slides at room temperature for 2 h. Labeling was 
detected by sequentially adding biotinylated secondary 
antibodies and strepavidin-peroxidase, and localized 
using 3,3'-diaminobenzidine reaction. Sections were then 
counterstained with hematoxylin. Substitution of the pri- 
mary antibody with phosphate-buffered-saline (PBS) 
served as a negative-staining control. 

mRNA in situ hybridization (ISH) 

Five-um-thick tissue sections were deparaffinized and 
dehydrated, then digested in pepsin solution (4 mg/ml in 
3% citric acid) for 20 min at 37.5°C, and further proc- 
essed for ISH. Digoxigenin-labeled sense and antisense 
human PSCA RNA probes (obtained from Boshide, 
Wuhan, China) were hybridized to the sections at 48 °C 
overnight. The posthybridization wash with a high strin- 
gency was performed sequentially at 37° C in 2 x standard 
saline citrate (SSC) for 10 min, in 0.5 * SSC for 15 min 
and in 0.2 x SSC for 30 min. The slides were then incu- 
bated to biotinylated mouse anti-digoxigenin antibody at 
37.5 °C fori h followed by washing in 1 x PBS for 20 min 
at room temperature, and then to strepavidin-peroxidase 
at 37.5 °C for 20 min followed by washing in I x PBS for 
15 min at room temperature. Subsequently, the slides 
were developed with diaminobenzidine and then coun- 
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Intensity x frequency 



Gleason score 0-6 (%) 9 (%) 



2-4 5(83) } (IT) 

5- 7 19(79) 5(21) 

6- I0 5(28) 13(72) 



Table 2: Correlation of PSCA expression with clinical stage 



Intensity * frequency 



Tumor stage 


0-6 (%) 


9(%) 


£8 


27 (67.5) 
2(25) 


13(32.5) 
6(75) 





terstained with hematoxylin to localize the hybridization 
signals. Sections hybridized with the sense control probes 
routinely did not show any specific hybridization signal 
above background. All slides were hybridized with PBS to 
substitute for the probes as a negative control. 

Scoring methods 

To determine the correlation between the results of PSCA 
immunostaining and mRNA in situ hybridization, the 
same scoring manners are taken in the present study for 
PSCA protein staining by IHC and PSCA mRNA staining 
by ISH. Each slide was read and scored by two independ- 
endy experienced urological pathologists using Olympus 
BX-41 light microscopes. The evaluation was done in a 
blinded fashion. For each section, five areas of similar 
grade were analyzed semiquantitativeiy for the fraction of 
cells staining. Fifty percent of specimens were randomly 
chosen and rescored to determine the degree of interob- 
server and intraobserver concordance. There was greater 
than 95% intra- and interobserver agreement. 

The intensity of PSCA expression evaluated microscopi- 
cally was graded on a scale of 0 to 3+ with 3 being the 
highest expression observed (0, no staining; 1+, mildly 
intense; 2+, moderately intense; 3+, severely intense). The 
staining density was quantified as the percentage of cells 
staining positive for PSCA with the primary antibody or 
hybridization probe, as follows: 0 = no staining; 1 = posi- 
tive staining in <25% of the sample; 2 « positive staining 
in 25%-50% of the sample; 3 = positive staining in >50% 



of the sample. Intensity score (0 to 3+) was multiplied by 
the density score (0-3) to give an overall score of 0-9 
[1,5 J. In this way, we were able to differentiate specimens 
that may have had focal areas of increased staining from 
those that had diffuse areas of increased staining [6]. Hie 
overall score for each specimen was then categorically 
assigned to one of the following groups: 0 score, negative 
expression; 1-2 scores, weak expression; 3-6 scores, mod- 
erate expression; 9 score, strong expression. 

Statistical analysis 

Intensity and density of PSCA protein and mRNA expres- 
sion in BPH, PIN and Pea tissues were compared using the 
Chi-square and Student's t-test. Univariate associations 
between PSCA expression and Gleason score, clinical 
stage and progression to androgen-independence were 
calculated using Fisher's Exact Test. For all analyses, p < 
0.05 was considered statistically significant. 

Results 

PSCA expression In BPH 

In general, PSCA protein and mRNA were expressed 
weakly in individual samples of BPH. Some areas of 
prostate expressed weak levels (composite score 1-2), 
whereas other areas were completely negative (composite 
score 0). Four cases (20%) of BPH had moderate expres- 
sion of PSCA protein and mRNA (composite score 4-6) 
by IHC and ISH. In 2/20 (10%) BPH specimens, PSCA 
mRNA expression was moderate (composite score 3-6), 
but PSCA protein expression was weak (composite score 
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2) in one and negative (composite score 0) in the other. 
PSCA expression was localized to the basal and secretory 
epithelial cells, and prostatic stroma was almost negative 
staining for PSCA protein and mRNA in ail cases 
examined. 

PSCA expression in PIN 

In this study, we detected weak or negative expression of 
PSCA protein and mRNA (£2 scores) in 7 of 9 (77.8%) 
low grade PIN and in 2 of 1 1 (18.2%) HGPIN, and mod- 
erate expression (3-6 scores) in the rest 2 low grade PIN 
and 5 of 11 (45.5%) HGPIN. One HGPIN with moderate 
PSCA mRNA expression (6 score) was found weak stain- 
ing for PSCA protein (2 score) by IHC. Strong PSCA pro- 
tein and mRNA expression (9 score) were detected in the 
remaining 3 of 11 (27.3%) HGPIN. There was a statisti- 
cally significant difference of PSCA protein and mRNA 
expression levels observed between HGPIN and BPH (p < 
0.05), but no statistical difference reached between low 
grade PIN and BPH (p > 0.05). 

PSCA expression in Pea 

In order to determine if PSCA protein and mRNA can be 
detected in prostate cancers and if PSCA expression levels 
are increased in malignant compared with benign glands, 
Forty-eight paraffin-embedded Pea specimens were ana- 
lysed by IHC and ISH. It was shown that 19 of 48 (39.6%) 
Pea samples stained very strongly for PSCA protein and 
mRNA with a score of 9 and another 21 (43.8%) speci- 
mens displayed moderate staining with scores of 4-6 (Fig- 
ure 1), In addition, 4 specimens with moderate to strong 
PSCA mRNA expression (scores of 4-9) had weak protein 
staining (a score of 2) by IHC analyses. Overall, Pea 
expressed a significandy higher level of PSCA protein and 
mRNA than any other specimen category in this study (p 
< 0.05, compared with BPH and PIN respectively). The 
result demonstrates that PSCA protein and mRNA are 
overexpressed by a majority of human Pea. 

Correlation of PSCA expression with Gleason score in Pea 
Using the semi-quantitative scoring method as described 
in Materials and Methods, we compared the expression 
level of PSCA protein and mRNA with Gleason grade of 
Pea, as shown in Table 1. Prostate adenocarcinomas were 
graded by Gleason score as 2-4 scores = well-differentia- 
tion, 5-7 scores = moderate-differentiation and 8-10 
scores ^ poor-differentiation [7]. Seventy-two percent of 
Gleason scores 8-10 prostate cancers had very strong 
staining of PSCA compared to 21% with Gleason scores 
5-7 and 17% with 2-4 respectively, demonstrating that 
poorly differentiated Pea had significantly stronger 
expression of PSCA protein and mRNA than moderately 
and well differentiated tumors (p < 0.05). As depicted in 
Figure 1, IHC and ISH analyses showed that PSCA protein 
and mRNA expression in several cases of poorly differen- 



tiated Pea were particularly prominent, with more intense 
and uniform staining. The results indicate that PSCA 
expression increases significantly with higher tumor grade 
in human Pea. 

Correlation of PSCA expression with clinical stage in Pea 
With regards to PSCA expression in every stage of Pea, we 
showed the results in Table 2. Seventy-five percent of 
locally advanced and node positive cancers (i.e. C-D 
stages) expressed statistically high levels of PSCA versus 
32.5% that were organ confined (i.e. A-B stages) (p < 
0.05). The data demonstrate that PSCA expression 
increases significantly with advanced tumor stage in 
human Pea. 

Correlation of PSCA expression with androgen- 
independent progression of Pea 

All 9 specimens of androgen-independent prostate can- 
cers stained positive for PSCA protein and mRNA. Eight 
specimens were obtained from patients managed prior to 
androgen ablation therapy. Seven of eight (87.5%) of 
these androgen-independent prostate cancers were in the 
strongest staining category (score = 9), compared with 
three out of eight (37.5%) of patients with androgen- 
dependent cancers (p < 0.05). The results demonstrate 
that PSCA expression increases significantly with progres- 
sion to androgen-independence of human Pea. 

It is evident from the results above that within a majority 
of human prostate cancers the level of PSCA protein and 
mRNA expression correlates significantly with increasing 
grade, worsening stage and progression to androgen-inde- 
pendence. 

Correlation of PSCA immunostaining and mRNA in situ 
hybridization 

In all 88 specimens surveyed herein, we compared the 
results of PSCA IHC staining with mRNA ISH analysis. 
Positive staining areas and its intensity and density scores 
evaluated by IHC were identical to those seen by ISH in 79 
of 88 (89.8%) specimens (18/20 BPH, 19/20 PIN and 42/ 
48 Pea respectively). Importantly, 27/27 samples with 
PSCA mRNA composite scores of 0-2, 32/36 samples 
with scores of 3-6 and 22/24 samples with a score of 9 
also had PSCA protein expression scores of 0-2, 3-6 and 
9 respectively. However, in 5 samples with PSCA mRNA 
overall scores of 3-6 and in 2 with scores of 9 there were 
less or negative PSCA protein expression (i.e. scores of 0- 
4), suggesting that this may reflect posttranscriptional 
modification of PSCA or that the epitopes recognized by 
PSCA mAb may be obscured in some cancers. The data 
demonstrate that the results of PSCA immunostaining 
were consistent with those of mRNA ISH analysis, show- 
ing a high degree of correlation between PSCA protein 
and mRNA expression. 
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Figure I 

Representatives of PSCA IHC and ISH staining in Pea (A. IHC staining, B. ISH staining, *200 magnification). A,, B,: negative con- 
trol of IHC and ISH. PBS replacing the primary antibody (A,) and hybridization with a sense PSCA probe (B,) showed no back- 
ground staining. A 2 . Bj: a moderately differentiated Pea (Gleason score = 3+3 = 6) with moderate staining (composite score = 
6) in all malignant cells; A 2 : IHC shows not only cell surface but also apparent cytoplasmic staining of PSCA protein. A 3 , B 3 : a 
poorly differentiated Pea (Gleason score = 4+4 = 8) with very strong staining (composite score = 9) in all malignant cells. 
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Discussion 

PSCA is homologous to a group of cell surface proteins 
that mark the earliest phase of hematopoietic develop- 
ment. PSCA mRNA expression is prostate-specific in nor- 
mal male tissues and is highly up-regulated in both 
androgen-dependent and-independent Pea xenografts 
(LAPC-4 tumors). We hypothesize that PSCA may play a 
role in Pea tumorigenesis and progression, and may serve 
as a target for Pea diagnosis and treatment. In this study, 
IHC and ISH showed that in general there were weak or 
absent PSCA protein and mRNA expression in BPH and 
low grade PIN tissues. However, PSCA protein and mRNA 
are widely expressed in HGPIN, the putative precursor of 
invasive Pea, suggesting that up-regulation of PSCA is an 
early event in prostate carcinogenesis. Recently, Reiter RE 
etal [1], using ISH analysis, reported that 97 of 118 (82%) 
HGPIN specimens stained strongly positive for PSCA 
mRNA. A very similar finding was seen on mouse PSCA 
(rnPSCA) expression in mouse HGPIN tissues by Tran C. 
P et al [8]. These data suggest that PSCA may be a new 
marker associated with transformation of prostate cells 
and tumorigenesis. 

We observed that PSCA protein and mRNA are highly 
expressed in a large percentage of human prostate cancers, 
including advanced, poorly differentiated, androgen- 
independent and metastatic cases. Fluorescence-activated 
cell sorting and confocal/ immunotluorescent studies 
demonstrated cell surface expression of PSCA protein in 
Pea cells [9). Our IHC expression analysis of PSCA shows 
not only cell surface but also apparent cytoplasmic stain- 
ing of PSCA protein in Pea specimens (Figure 1). One pos- 
sible explanation for this is mat anti-PSCA antibody can 
recognize PSCA peptide precursors that reside in the cyto- 
plasm. Also, it is possible that the positive staining that 
appears in the cytoplasm is actually from the overlying 
cell membrane [5 J. These data seem to indicate that PSCA 
is a novel cell surface marker for human Pea. 

Our results show that elevated level of PSCA expression 
correlates with high grade (i.e. poor differentiation), 
increased tumor stage and progression to androgen-inde- 
pendence of Pea. These findings support the original IHC 
analyses by Gu 2 et al (9], who reported that PSCA protein 
expressed in 94% of primary Pea and the intensity of 
PSCA protein expression increased with tumor grade, 
stage and progression to androgen-independence. Our 
results also collaborate the recent work of Han KR et al 
[10], in which the significant association between high 
PSCA expression and adverse prognostic features such as 
high Gleason score, seminal vesicle invasion and capsular 
involvement in Pea was found. It is suggested that PSCA 
overexpression may be an adverse predictor for recur- 
rence, clinical progression or survival of Pea. Hara H et al 
(11] used RT-PCR detection of PSA, PSMA and PSCA in 1 



ml of peripheral blood to evaluate Pea patients with poor 
prognosis. The results showed that among 58 PCa 
patients, each PCR indicated the prognostic value in the 
hierarchy of PSCA>PSA>PSMA RT-PCR, and extraprostatic 
cases with positive PSCA PCR indicated lower disease-pro- 
gression-free survival than those with negative PSCA PCR, 
demonstrating that PSCA can be used as a prognostic fac- 
tor. Dubey P et al |12] reported that elevated numbers of 
PSCA + cells correlate positively with the onset and devel- 
opment of prostate carcinoma over a long time span in 
the prostates of the TRAMP and PTEN +/- models com- 
pared with its normal prostates. Taken together with our 
present findings, in which PSCA is overexpressed from 
HGPIN to almost frank carcinoma, it is reasonable and 
possible to use increased PSCA expression level or 
increased numbers of PSCA-positive cells in the prostate 
samples as a prognostic marker to predict the potential 
onset of this cancer. These data raise the possibility that 
PSCA may have diagnostic utility or clinical prognostic 
value in human Pea. 

The cause of PSCA overexpression in Pea is not known. 
One possible mechanism is that it may result from PSCA 
gene amplification. In humans, PSCA is located on chro- 
mosome 8q24.2 [1], which is often amplified in meta- 
static and recurrent Pea and considered to indicate a poor 
prognosis [13-15]. Interestingly, PSCA is in close proxim- 
ity to the c-myc oncogene, which is amplified in >20% of 
recurrent and metastatic prostate cancers [16,17]. Reiter 
RE et al 1 18| reported that PSCA and MYC gene copy num- 
bers were co-amplified in 25% of tumors (five out of 
twenty), demonstrating that PSCA overexpression is asso- 
ciated with PSCA and MYC coamplification in Pea. Gu Z 
et al |9] recently reporteted that in 102 specimens availa- 
ble to compare the results of PSCA immunostaining with 
their previous mRNA ISH analysis, 92 (90.2%) had iden- 
tically positive areas of PSCA protein and mRNA expres- 
sion. Taken together with our findings, in which we 
detected moderate to strong expression of PSCA protein 
and mRNA in 34 of 40 (85%) Pea specimens examined 
simultaneously by IHC and ISH analyses, it is demon- 
strated that PSCA protein and mRNA overexpressed in 
human Pea, and that the increased protein level of PSCA 
was resulted from the upregulated transcription of its 
mRNA. 

At present, the regulation mechanisms of human PSCA 
expression and its biological function are yet to be eluci- 
dated. PSCA expression may be regulated by multiple fac- 
tors [18]. WatabeTet al [19] reported that transcriptional 
control is a major component regulating PSCA expression 
levels. In addition, induction of PSCA expression may be 
regulated or mediated through cell-cell contact and pro- 
tein kinase C (PKC) [20]. Homoiogues of PSCA have 
diverse activities, and have themselves been involved in 
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carcinogenesis. Signalling through SCA-2 has been dem- 
onstrated to prevent apoptosis in immature thymocytes 
[21]. Thy-1 is involved in T cell activation and transducts 
signals through src-like tyrosine kinases [22], Ly-6 genes 
have been implicated both in tumorigenesis and in cell* 
cell adhesion {23-25). Cell-cell or cell-matrix interaction is 
critical for local tumor growth and spread to distal sites. 
From its restricted expression in basal cells of normal 
prostate and its homology to SCA-2, PSCA may play a role 
in stem/progenitor cell function, such as self-renewal (i.e. 
anti-apoptosis) and/or proliferation [lj. Taken together 
with the results in the present study, we speculate that 
PSCA may play a role in tumorigenesis and clinical pro- 
gression of Pea through afFecting cell transformation and 
proliferation. From our results, it is also suggested that 
PSCA as a new cell surface antigen may have a number of 
potential uses in the diagnosis, therapy and clinical prog- 
nosis of human Pea. PSCA overexpression in prostate 
biopsies could be used to identify patients at high risk to 
develop recurrent or metastatic disease, and to discrimi- 
nate cancers from normal glands in prostatectomy sam- 
ples. Similarly, the detection of PSCA-overexpressing cells 
in bone marrow or peripheral blood may identify and pre- 
dict metastatic progression better than current assays, 
which identify only PSA-positive or PSMA-positive pros- 
tate cells. 

In summary, we have shown in this study that PSCA pro- 
tein and mRNA are maintained in expression from 
HGPIN through all stages of Pea in a majority of cases, 
which may be associated with prostate carcinogenesis and 
correlate positively with high tumor grade (poor cell dif- 
ferentiation), advanced stage and androgen independent 
progression. PSCA protein overexpression is due to the 
upregulation of its mRNA transcription. The results sug- 
gest that PSCA may be a promising molecular marker for 
the clinical prognosis of human Pea and a valuable target 
for diagnosis and therapy of this tumor. 
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Abstract 

Translation Initiation fe regulated In response to 
nutrient availability and mftogenic stimulation and Is 
coupled with ceD cycle progression and ceB growth. 
Several alterations In transnational control occur (n 
cancer. Variant mRNA sequences can after the 
translations! efficiency of Irofividual mRNA molecules* 
which In turn play a role In cancer biology. Changes In 
the expression or availability of components of the 
translational machinery and In the activation of 
translation through signal transduction pathways can 
lead to more globed changes, such as an increase in 
the ovarii rate of protein synthesis and translational 
activation of the mRNA molecules Involved In ceil 
growth ami proliferation. We review the basic 
principles of translational control, the alterations 
encountered In cancer, and selected therapies 
targeting translation Initiation to help elucidate new 
therapeutic avenues. 

Introduction 

The fundamental principle of molecular therapeutics In can- 
cer Is to exploit the differences In gene expression between 
cancer cells and normal cells. With the advent of cDNA array 
technology, most efforts have concentrated on Identifying 
differences In gene expression at the level of mRNA, which 
can be attributable either to DNA amplification or to differ- 
ences In transcription- Gene expression Is quite complicated, 
however, and Is also regulated at the level of mRNA stability, 
mRNA translation, and protein stability. • 

The power of translational regulation has been best recog- 
nized among developmental biologists, because transcription 
does not occur In early ernbryogenesls In eukaryotes. For ex- 
ample, in Xenopus, the period of transcriptional quiescence 
continues until the embryo reaches midblastuia transition, the 
4000-cell staga Therefore, ail necessary mRNA molecules are 
transcribed during oogenesis end stockpiled m a translaGonaily 
Inactive, masked form. The mRNA are transtatlonally activated 
at appropriate times during oocyte maturation, fertilization, and 



eariy ernbryogenesls and thus, are under strict translational 
control. 

Translation has an established role In cell growth. Basi- 
cally, an Increase in protein synthesis occurs as a conse- 
quence of mltogenesls. Until recently, however, Itttie was 
known about the alterations In mRNA translation In cancer; 
and much la yet to be discovered about their role In tho 
development and progression of cancer. Here we review the 
baste principles of transnational control, the alterations en- 
countered In cancer, and selected therapies targeting transla- 
tion Initiation to elucidate potential new therapeutic avenues. 

Basic Principles of Translational Control 
Mechanism of Translation initiation 
Translation Initiation b the main step In translational regulation. 
Translation initiation is a complex pjocess In which the Initiator 
tRNA and the 40$ and 60S rtbosomaJ suburttserarecruKedto 
the 5* end of a mRNA molecule and assembled by eukaryotic 
translation Initiation tactore into an 80S rfoosome at the start 
cookxntfthemFuNA(r^1).Tr^^ 

capped, Le., contains the cap structure m 7 GpppN ij^m^r^ 
guarrasine-inprrcsp^ Most translation In 

eukaryotes occurs In a cap-dependent fashion, A&, the cap is 
specifically recognized by the elF4E, a which binds the 5' cap. 
The elF4J= translation Initiation complex Is then formed by the 
assembly of elF4E, the RNA heOcase eiF4A, and elF4G, a 
scaffolding protein that mediates the binding of the 40S ribo- 
soma! subunlt to the mRNA molecule through Interaction with 
the elF3 protein present on the 40S ribosoma elF4A and elF4B 
participate in meffing the secondary structure of the 5' UTR of 
the mRNA. The 43S Initiation complex (40S/eir^eMRrW 
QTP complex) scans the mRNA in a 6'-*3 r direction until it 
encounters an AUG start codon. This start codon Is then base- 
paired to the anticodon of Initiator tRNA, forming the 48$ Initi- 
ation complex. The Initiation factors are then displaced from the 
48S complex, and the 60S ribosome joins to form the 80S 
ribosoma 

Unlike most eukaryotic translation, translation Initiation of 
certain mRNAs, such as the ptcomavirus RNA, Is cap inde- 
pendent and occurs by internal ribosome entry. This mecha- 
nism does not require e!F4E Either the 43$ complex can bind 
the initiation codon directly through interaction with the IRES in 
the 5' UTR such as in the encephalomyocanditis virus, or it can 
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J. Translation initiation In eukaryotes. The 4E-BPs are hyperphos- 
PjH>ryla1«dto release etF4£ so that ft can Interact with the 5* cap, and the 
SSL^fS?^ b aS58mbted * Weractton of poly (A) binding 

S? 1 ™?^™ <fiagram. The secondary structure of the 5' UTR is melted, 
the^^ribosomeJ subuntt Is bound td elF3 1 and me ternary complex 
<»n«^ofe!F2,GTP,andtheMtf^ 

£>f«*w i scans tho mHNA In a5'-*3' direction wits an AUQ start codon 
Sii^ * JJ* ^^P*** context The Initiation factors are 

released, and the large ribosomal subuntt is recruited. 



Initially attach to the IRES and then reach the initiation codon by 
scanning or transfer, as Is the case with the poliovirus (1). 

Regulation of Translation Initiation 
Translation Initiation can be regulated by alterations In the 
expression or phosphorylation status of the various factors 
irwpfved. Key components In translations regulation that 
may provide potential therapeutic targets follow. 

ef F4E elF4E plays a central role In translation regulation. 
It Is the least abundant of the initiation factors and Is con- 
sidered the rate-llmfting component for Initiation of cap- 
dependent translation. elF4E may also be Involved In mRNA 
splicing, mRNA 3' processing, and mRNA nucfeocytoplas- 
mlc transport (2). elF4E expression can be Increased at the 
transcriptional level In response to serum or growth factors 
(3), eJF4E overexpression may cause preferential translation 
of mRNAs containing excessive secondary structure In their 
5' UTR that are normally discriminated against by the trans- 



latlonal machinery and thus are inefficiently translated (4-7). 
As examples of this, overexpression of elF4E promotes In- 
creased translation of vascular endothelial growth factor 
flbrdbl^ growth factor-2, andcycBnDI (2,6,9). 

Another mechanism of control Is the regulation of elF4E 
phosphorylation. elF4E phosphorylation Is mediated by the 
mltogen-activated protein Wns^Interacting kinase 1, which 
Is activated by the mltogen-activated pathway activating 
extracellular signal-related kinases and the stress-activated 
pathway acting through p38 mitogert-actlvated protein ki- 
nase (10-13). Several mitogens, such as serum, platelet- 
derived growth factor, epidermal growth factor, Insulin, 
angiotensin II, sre kinase overexpression, and ras over- 
expression, lead to elF4E phosphorylation (14). The phos- 
phorylation status of elF4E Is usually correlated with the 
translations! rate and growth status of the cell; however; 
elF4E phosphorytetion has also been observed In response 
to some cellular stresses when translational rates actually 
decrease (16% Thus, further study Is needed to understand 
the effects of eIF4E phosphorylation on elF4E activity. 

Another mechanism of regulation Is the alteration of elF4£ 
availability by the binding of elF4E to the elF4&blndlng pro- 
teins (4E-BP, also known as PHAS-l). 4E-BPs compete with 
elF4G for a binding site In elF4E. The binding of e)F4E to the 
best characterized elF4E^btadlng protein, 4E-BP1, Is regtH 
lated by 4E-BP1 phosphorylation. Hypophosphorylated 4E- 
BP1 binds to elF4^ whereas 4E-BP1 rryr^erprwsphoryiation 
decreases this binding. Insulin, angiotensin, epidermal 
growth factor, platelet-derived growth factor, hepatocyto 
growth factor, nerve growth factor, insullrnllke growth factors 
I and II, InterteuWn 3, ^rahulocyte-macrophage oolony-stiro- 
ulating factor + steel factor, gastrin, and the adenovirus have 
all been reported to induce phosphorylation of 4E-BP1 and 
to decrease the ability of 4E-BP1 to bind eIF4E (15, 10). 
Conversely, deprivation of nutrients or growth factors results 
In 4E-BP1 dephosphorylatlon, an increase in eIF4E binding, 
and a decrease In cap-dependent translation. 

p70 56 Kinase. Phosphorylation of ribosomal 40S protein 
S6 by S6K Is thought to play an Important role In translational 
regulation. S6K mouse embryonic cells proliferate more 
slowly than do parental cells, demonstrating that S6K has a 
positive Influence on cell proliferation (1 7). S6K regulates the 
translation of a group of mRNAs possessing a 5' terminal 
dlgopyrimkfine tract (5' TOP) found at the 5' OTP of ribosomal 
protein mRNAs and other mRNAs coding for components of 
the translational machinery. Phosphorylation of S6K Is regu- 
lated In part based on the availability of nutrients (18, 19} and is 
stimulated by several growth factors, such as platelet-derived 
growth factor and fnsulMka growth factor I {20). 

elF2<* Phosphorylation. The binding of the Initiator tRNA 
to the small ribosomal unit Is mediated by translation Initia- 
tion factor eIF2. Phosphorylation of the o-subuntt of elF2 
prevents formation of the elR2/QTP/Met-tRNA complex and 
inhibits global protein synthesis (21, 22). elF2a Is phospho- 
rylated under a variety of conditions, such as viral Infection, 
nutrient deprivation, heme deprivation, and apoptosls (22). 
elF2a is phosphoryfated by heme-reguiated Inhibitor, nutrient- 
regulated protein kinase, and the ITO-taduced, double- 
stranded RNA-activated protein kinase (PKR; Ret 23). 



The mTOR Signaling Pathway. The macrollde antibiotic 
rapamydn (SIrallmus; Wyeth-Ayerst Research, Coltegevffle, 
PA) has been the subject of intensive study because It In- 
hibits signal transduction pathways Involved In T-cell activa- 
tion. Tlie rapamycln-sensftfve component of these pathways 
Is mTOR (also called FRAP or RAFT1). mTOR Is the mam- 
malian homologue of the yeast TOR proteins that regulate G, 
progression and translation In response to nutrient availabil- 
ity (24). mTOR Is a serine-threonine kinase that modulates 
translation Initiation by altering the phosphorylation status of 
4E-BP1 and S6K (Fig. 2; Ret 25). 

4&BP1 Is phosphoryfated on multiple residues. mTOR phos- 
phorylates the Thr-37 and "Thr-46 residues of 4E-BP1 in vitro 
(26);hcwever,phosphoryiatto 

with a loss of elF4E binding. Phosphorytation of Tbr-37 and 
Thr-48 is required for subsequent phosphorylation at several 
COOH-termlnal, serum-sensffive sites; a combination of these 
phosphorylation events appears to be needed to Inhibit the 
binding of 4MP1 to e!F4E (25). The product of the/tfMgene,. 
P38/MSK1 pathway, and protein Wnase Co- also play a rote In 
4E-BP1 phosphorytation (27-29). 

S6K and 4E-BP1 are also regulated, In part, by PI3K and Its 
downstream protein kinase Akt PTBM Is a phosphatase that 
negatively regulates PBK signaling. PTEN nuD cells have 
constrtutfvely active of Akt, wfth Increased S6K activity and 
SB phosphorylation (30). S8K activity Is Inhibited both by 
PI3K Inhibitors wortmannln and LY294002 and by mTOR 
Inhibitor rapamydn (24). Akt phosphorylates Ser-2448 In 
mTOR In vitro, and this site Is phosphorylated upon Akt 
activation in vivo (31-33). Thus, mTOR Is regulated by the 
PK3K/Akt pathway; however, this does riot appear to be the 
only mode of regulation of mTOR activity. Whether the PI3K 
pathway also regulates S6K and 4E-BP1 phosphorylation 
Independent of mTOR Is controversial. 

Interestingly, mTOR autophosphorylation is blocked bywort- 
marmlnbirtnotbyrapamyc^ 

suggests that mTOR-fesponslve regulation of 4E-BP1 and S8K 
activity occurs through a mechanism other than Intrinsic mTOR 
W nase activity. An alternate pathway for 4BBP1 and S8K phos- 
phorylation by mTOR activity b by the Inhibition of a phospha- 
tase. Treatment with calyculin A, an inhibitor of phosphatases 1 
and 2A, reduces rapamycln-lnduced dephosphorylatfon of 4E- 
BP1 and S6K by rapamydn (35). PP2A Interacts with full-length 
S6K but not with a S6K mutant that Is resistant to dephospho- 
rylatlon resulting from rapamydn. mTOR phosphorylates PP2A 
in vitro; however, how this process alters PP2A activity Is not 
known. These results are consistent with the model that phos- 
phorylation of a phosphatase by mTOR prevents dephospho- 
ryiation of 4E-BP1 and S6K, and conversely, that nutrient dep- 
rivation and rapamydn block Inhibition of the phosphatase by 
mTOR. 

Polyadenyfatton. The po!y(A) tall In eukaryotlc mRNA Is 
Important In enhancing translation Initiation and mRNA sta- 
bility. Polyadenyfatton plays a key role In regulating gene 
expression during oogenesis and early embryogenesls. 
Some mRNA that are tranalattonally Inactive In the oocyte are 
polyadenylated concomitantly with translatlonal activation In 
oocyte maturation, whereas other mRNAs that are trartsla- 
tlonally active during oogenesis are deadenytated and trans- 



Growth ftctofs 




(mRNAs with highly structured 5* UTRa) 

Flg.2. Regulation of translation Inftaflon by signed transduction oath- 
way* SJgnatoQ vfa p38. eodraceOutar stgnaJ^ted teW^cTand 
mTOR can aB activate translation fcnftiatfon! 



latlonally silenced (38-38). Thus, control of poly(A) tan syn- 
thesis Is an important regulatory step In gene expression. 
The 5' cap and polyW tall are thought to lunctlon synergls- 
tically to regulate mRNA translatlonal efficiency (39, 40), 

RNA Packaging. Most RNA-blndlng proteins are assem- 
bled on a transcript at the time of transcription, thus deter- 
mining the translatlonal fate of the transcript (41). A highly 
conserved family of Y-box proteins Is found In cytoplasmic 
messenger ribonucleoproteln particles, where the proteins 
are thought to play a role In restricting the recruitment of 
mRNA to the translatlonal machinery (41-43). The major 
mRNA-assoclated protein, YB-1 , destabilizes the Interaction 
of elF4E and the 5' mRNA cap fa vitro, artf overexpresslon of 
YB-1 results In translatlonal repression in vivo (44). Thus, 
alterations In RNA packaging can also play an Important role 
In translatlonal regulation. 

Translation Alterations Encountered in Cancer 

Three m^n alterations at the translatlonal level occur In cancer 
variations In mRNA sequences that increase or decrease trans- 
latlonal efficiency, changes In the expression or availability of 
components of the translatlonal machinery, and activation of 
translation through aberrantly activated signal transduction 
pathways. Trie first alteration affects the translation of an Indi- 
vidual mRNA that may play a role In carcinogenesis. The sec- 
ond and third alterations can lead to more global changes, such 
as an Increase In the overaH rate of protein synthesis, and the 
translatlonal activation of several mRNA species. 

Variations In mRNA Sequence 
Variations in mRNA sequence affect the translatlonal effi- 
ciency of the transcript A brief description of these variations 
and examples of each mechanism follow. 

Mutations. Mutations in the mRNA sequence, especially 
In the 5' UTR, can alter Ha translatlonal efficiency, as seen In 
the following examples. 



ouye, SaHo etaL proposed thai translation offuIMength 
&rtyc is repressed, whereas In several BurWtt lymphomas 
that have deletions of the mRNA 5' UTR, translation ctic-myc 
Is more efficient (45) . More recently, It was reported that the 
5' UTR of c-myc contains an IRES, and thus c-myc transla- 
tion can be Initiated by a cap-Independent as well as a 
cap-dependent mechanism (46, 47). In patients with multiple 
myeloma, a C-»T mutation In the c-myc IRES was Identified 
(46) and found to cause an enhanced Initiation of translation 
via Internal ribosomal entry (49). 

8RQA1. A somatic point mutation (117 G-*C) in position 
-3 with respect to the start codon of the BRCA1 gene was 
Wentflted In a highly aggressive sporadic breast cancer (50). 
Chlmertc constructs consisting of the wild-type or mutated 
BRCA1 5' UTR and a downstream tuciferase reporter dem- 
onstrated a decrease In thetranslational efficiency with the 5' 
UTR mutation. 

Cydln-depondent Kinase Inhibitor 2A Some inherited 
melanoma kindreds have a G-»T transverslon at base -34 
of cycfin-dependent kinase inhibitor-2A, which encodes a 
cycfliKiependent kinase 4/cydIn-dependent kinase 6 kinase 
Inhibitor Important In G, checkpoint regulation (51). This 
mutatk>n t .glves rise to a novel AUG translation Initiation 
codon, creating an upstream open reading frame that com- 
petes for scanning ribosomes and decreases translation 
from the wild-type Aua 

Alternate SpDdng and Attentate Transcription Start 
Sites. Alterations In splicing and alternate transcription sites 
; can lead to variations in 5' LTO sequence, lengft, and second- 
ary stmcture, ultimately impacting translations! efficiency. 

ATM. The ATM gene has four noncoding exons in Its 5' 
UTR that undergo extensive alternative spBcing (52). The 
contents of 12 different 5' UTRs that show considerable 
diversity In length and sequence have been identified. These 
divergent 5' leader sequences play an important role In the 
translation^ regulation of the ATM gene. 

mdrn. In a subset of tumors, overexpresslon of the onco- 
proteln mdm2 results In enhanced translation of the mdm2 
mRNA. Use of different promoters leads to two mdm2 tran- 
scripts that differ only In their 5' leaders (53). The longer 5 r 
UTR contains two upstream open reading frames, and this 
mRNA is loaded with ribosomes Inefficiently compared with 
the short 5' UTR. 

BRCA1. In a normal mammary gland, BRCA1 mRNA Is 
expressed with a shorter leader sequence (5'UTRa), whereas 
In sporadic breast cancer tissue, BRCA1 mRNA Is expressed 
with a longer leader sequence (5 r UTRb); the transiational 
efficiency of transcripts containing 5' UTRb is 10 times lower 
than that of transcripts containing 5' UTRa (54). 

TGF-fiS. TGF-p3 mRNA includes a 1,1-kb 5' UTR, which 
exerts an Inhibitory effect on translatloa Many human breast 
cancer cell fines contain a novel TOF-03 transcript with a 5' 
UTR that Is 870 nucleotides shorter and has a 7-fold greater 
transiational efficiency than the normal TCF-/33 mRNA (55), 

Alternate Polyadenylation Sites. Multiple polyadenyl- 
atlon signals leading to the generation of several transcripts 
with differing 3' UTR have been described for several mRNA 
species, such as the RET proto-oncogene (56), ATM gene 
(52), tissue Inhibitor gf metalloprotelnases-3 057), RHOA 



proto-oncogene 058), and calmodulin-! (59). Although th© 
effect of these alternate 3' UTRs on translation Is not y&t 
known, they may be Important In RNA-proteln interactions 
that affect translattdnal recruitment The role of these alter- 
ations in cancer development and progression Is unknown. 



Alterations In tho Component* of the 
Translation Machinery 

Alterations In the components of translation machinery can 
take many forms. 

Overexpressslon of elF4EL Overexpresslon of elF4E 
causes malignant transformation in rodent ceils (60) and the 
deregulation of HeLa cell growth (81). Potunovsky etal. (62) 
found that eiF4E overexpresslon substitutes for serum and 
Individual growth factors In preserving viability of fibroblasts, 
which suggests that elF4E can mediate both proliferative and 
survival signaling. 

Elevated levels of eIF4E mRNA have been found in a broad 
spectrum of transformed eel lines (63). elF4£ levels am 
elevated In al I ductal carcinoma In situ specimens and Inva- 
sive ductal carcinomas, compared with benign breast spec- 
imens evaluated with Western blot analysis (64, 65). Prelim- 
inary studies suggest that this overexpresslon Is attributable 
to gene amplification (66). 

There are accumulating data suggesting that efF4E overex- 
presslon can be valuable as a prognostic marker. e!F4E over- 
expression was found in a retn*pective studytobeamaikerof 
poor prognosis In stages I to in breast carcinoma (67). Verifica- 
tion of the prognostic value of elF4E in bn?ast cancer Is now 
under way In a prospective trial (67). However, In a different 
study, elF4E expression was correlated with the aggressive 
behaviorof non-HodgWn's lymphomas (68). in a prospective 
arialysis of patients with head ^ 

of eiF4E In htetotogfcafly tumor-free surgical margins predicted 
a significantly increased risk of local-regional recurrence (9). 
These results all suggest that eIF4E overexpresslon can be 
used to select patients who might benefit from more aggressive 
systemte therapy. r\irtriermore 

suggest that elF4E overexpresslon is a field defect and can be 
used to guide local therapy. 

Alterations in Other Initiation Factors. Alterations In a 
number of other initiation factors have been associated with 
cancer. Overproduction of elF4Q, similar to elF4E, leads to 
malignant transformation In vflro (69). eIF~2a Is found In 
increased levels In bronchloloalveolar carcinomas of the lung 
(3). Initiation factor eIF-4A1 Is overexpressed In melanoma 
(70) and hepatocellular carcinoma (71). The p40 subunlt of 
translation initiation factor 3 is amplified and overexpressed 
In breast and prostate cancer (72), and the elF3-p1 10 subunit 
is overexpressed In testicular seminoma (73). The role that 
overexpresslon of these initiation factors plays on the devel- 
opment and progression of cancer, if any, Is not known. 

Overexpresslon of S6K. S6K Is amplified and highly 
overexpressed In the MCF7 breast cancer cell line, com- 
pared with normal mammary epithelium (74). In a study by 
Bariund ef af. (74), S6K was amplified in 59 of 668 primary 
breast tumors, and a statistically significant association was 
observed between amplification and poor prognosis. 
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Overexpresslon of PAP. PAP catalyzes 3' poty(A) syn- 
desis. PAP teoverexpressed In human cancer cells com- 
pared with nonmal and vtraily transformed cells (75). PAP 
eittymatfc activity In breast tumors has been correlated wfth 
PAP protein levels (76) and, In mammary tumor cytosols, was 
found to be an Independent factor for predicting survival (76). 
little Is known, however, about how PAP expression or ac- 
tivity affects the translations! profile. 

Alterations In RNA-bindlng Proteins. Even less b known 
about alterations In RNA packaging In cancer. Increased ex- 
pression end nuclear localization of the RNA-Wndlng protein 
YB-1amkKfic»torsofapoorprog^ 
non-smaB cefl &mg cancer (Tfy and ovarian carK^(7fy How- 
ever, this effect may be mediated at least In part at the level of 
transcription, because YB-1 hcreasescherrK>re^ 
handng the transcription of a multidrug resistance gene (80). 



Activation of Signal Transduction Pathways 
Activation of signal transduction pathways by loss of tumor 
suppressor ger^ or overaxpression 
can contribute to the growth and aggressrveness of tumors. An 
Important mutant In human cancels Is the tumor suppressor 
gene PTOV, which leads to the activaHon of the PBK/Akt path- 
way. Activation of PBK and AW^ 
fonnattonofcWckenembr^ 
show constitutive ptawphorylatfonof S6Kandof 4^ 
A mutant Akt that retains kinase activity but does not phos- 
phorytedeS6Kor4E-BPt does not transform fibrobl^ 
suggests a correlation between the oncogenicity of PJ3K and 
Akt and the phosphorylation of S6K and 4E-BP1 (81). 

Several tyrosine kinases such as platelet-derived growth 
factor, fnsulMke growth factor, HER2toeu, and epidermal 
growth factor receptor are overexpressed In cancer. Be- 
cause these Mnasea activate downstream signal transduc- 
tion pathways known to alter translation initiation, activation 
of translation is likely to contribute to the growth and aggres- 
siveness of these tumors. Furthermore, the mRNA for many 
of these kinases themselves are under translattonal control. 
For example, HER2/neu mRNA Is translatlonally controlled 
both by a short upstream open reading frame that represses 
HER2/neu translation In a cell type-Independent manner and 
by a distinct cell type-dependent mechanism that increases 
translattonal efficiency (82). HER2/neu translation Is different 
in transformed and normal cells. Thus, it Is possible that 
alterations at the translallonal level can In part account for 
the discrepancy between HEft2/neu gene amplification de- 
tected by fluorescence In situ hybridization and protein levels 
detected by ImrnunohlstochemlcaJ assays. 



Translation Targets of Selected Cancer Therapy 
Components of the translation machinery and signal path- 
ways Involved In the activation of translation Inflation repre- 
sent good targets for cancer therapy. 

Targeting the mTOR Signaling Pathway: Rapamycln 
and Tumstatfri 

Rapamycln Inhibits the proliferation of lymphocytes. It was 
Initially developed as an Immunosuppressive drug for organ 



transplantation. Rapamycln with FKBP 12 (FRSOd-bkidtng 
protein, M r 12,000) binds to mTOR to inhibit As function. 

Rapamycln causes a small but significant reduction In the 
initiation rate of protein synthesis (83). It blocks cell growth I n 
part by blocking S8 phosphorylation and selectively su im- 
pressing the translation of 5'TOP mRNAs, such as ribosomal 
proteins, and elongation factors (83-85). Rapamycln also 
blocks 4E-BP1 r^sphorylatfon and inhibits cap^lepehdent 
but not cap-independent translation (1 7, 80). 

The rapamydn-sensttive signal transduction pathway, acti- 
vated oaring maDgr^ transfix 
IsrwwrjelngsturiTedasatar^ 

tats, breast, smafi cefl lung, gflobJastoma, melanoma, and Tcea 
leukemia are among the cancer fines most sensitive to the 
rapamycln analogue CO-779 (VVyetrhAyerst Research; Ref. 
87). InrhaJxtomycosaixxrwceaBne^ rapsuT^cin Is either cyto- 
static or cytocidai, deperKiing on tter^ status of the cell; p53 
wDcHype ceils treated wfm raparr^ arrest In the G, phase 
and maintain thdr vtabtlity, whereas p53 mutant cefe accumu- 
late In G, and undergo apoptosfs (88,88), In a recent^ reported 
study using human primitive neuroectodermal tumor and 
medulloblastoma models, rapamycln exhibited more cytotox- 
ictty In combination wfth dsptaiin and camptothedn than as a 
single agent In Ww>, CCI-779 delayed growth of xenografts by 
160% after 1 week of therapy and 240% after2 weeks. A single 
high-dose administration caused a 37% decrease In tumor 
volume. Growth inhibition h vfVo was 1.3 times greater, wfth 
clsplatin In combination with CCI-779 than with cisplatfn atone 

(90) . Thus, preclinical studies suggest that rapamycln ana- 
logues are useful as single agents and In combination with 
chemotherapy. 

Rapamycln analogues CCI-779 and RAD001 (Novartls, 
Basel, Switzerland) are now In clinical trials. Because of the 
known effect of rapamycln on lymphocyte proliferation, a 
potential problem with rapamycln analogues is Immunosup- 
pression. However, although prolonged immunosuppression 
can result from rapamycln and CCI-779 administered on 
continuous-oose schedules, the immunosuppressive effects 
of rapamycln analogues resolve In ^24 h after therapy 

(91) . The principal toxicities of CCI-779 have Included der- 
matological toxicity, myetosuppression, infection, mucositis, 
diarrhea, reversible elevations In liver function tests, hyper- 
glycemia, hypokalemia, hypocalcemia, and depression (87, 
92-94). Phase II trials of CCI-779 have been conducted in 
advanced renal cell carcinoma and In stage HI/TV breast 
carcinoma patients who felted with prior chemotherapy, in 
the results reported in abstract form, although there were no 
complete responses, partial responses were documented In 
both renal cell carcinoma and in breast carcinoma (94, 95). 
Thus, CCi-779 has documented preliminary clinical activity In 
a previously treated, unsetected patient population. 

Active investigation Is under way into patient selection for 
mTOR Inhibitors. Several studies have found an enhanced 
efficacy of CCI-779 in PTEN-null tumors (30, 96). Another 
study found that six of eight breast cancer ceil lines were 
responsive to CCI-779, although only two of these lines 
lacked PTEN (97) There was, however, a positive correlation 
between Akt activation and CCI-779 sensitivity (97). TWs 
correlation suggests that activation of the PI3K-Akt pathway, 
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regardless of whether It Is attributable to a PTEN mutation or 
to ovenexpresslori of receptor tyrosine kinases, makes can- 
cer cell amenable to mTOR-dtacted therapy. In contrast, 
lower levels of the target of mTOR, 4E-BP1, are associated 
with rapamycto resistance; thus, a lower 4E-BP1/elF4E ratio 
may predict rapamycfn resistance (98). 

Another mode of activity for rapamycln and its analogues 
appears to be through Inhibition of anglogenesls. This activ- 
ity may be both through direct inhibition of endothelial cell 
proliferation as a result of mTOR Inhibition hi these cells or by 
Inhibition of translation of such proanglogenlc factors as 
vascular endothelial growth factor In timior cells ^9, 10P). 

The angiogenesis Inhibitor tumstatin, another anticancer 
drug currently under study, was also found recently to Inhibit 
translation in endothelial cells (101). Through a requisite in- 
teraction with fntegrfn, tumstatin Inhibits activation of the 
PJ3K/Akt pathway and mTOR in endothelial cells and pre- 
vents dissociation of elF4E from 4E-BP1, thereby Inhibiting 
cap-dependent translation. These findings suggest that en- 
dothelial cells are especially sensitive to therapies targeting 
the mTOFfcsignalfng pathway. 

TargetMg elteo: EPA, Clotrimazole, mda-7, 
andRavonoids 

EPAIsann-3 polyunsaturated fatty acid found In the fish- 
based diets of populations having a low Incidence of cancer 
(102}. EPA Inhibits the proliferation of cancer cells (103), as 
weq as In animal models (104, 1 05). it blocks cell division by 
inhibiting translation initiation (105). EPA releases Ca 2+ from 
Intracellular stores while inhibiting their refilling, thereby ac- 
tivating PKR PKR, in turn phosphorylates and inhibits elF2a, 
resulting In the inhibition of protein synthesis at the level of 
translation Initiation. Similarly, clotrimazole, a potent antipro- 
liferative agent In vitro and In Ww>, inhibits cell growth through 
depletion of Ca 2 - stores, activation of PKR, and phospho- 
rylation of e!F2a (106). Consequently, clotrimazole preferen- 
tially decreases the expression of cycllns A, E, and Dl, 
resulting In blockage of the cell cycle In Q v 

mda-7 is a novel tumor suppressor gene being developed 
as a gene therapy agent Adenoviral transfer of mda-7 (Ad- 
mda7) Induces apoptosis In many cancer cells Including 
breast, colorectal, and lung cancer (107-109). Ad-mda7 also 
Induces and activates PKR, which leads to phosphorylation 
of efF2a and induction of apoptosis (110). 

Flavonoids such as genlstein and quercetin suppress tu- 
mor cefl growth. All three mammalian elF2a kinases, PKR, 
heme-regulated Inhibitor, and PERK/PEK, are activated by 
flavonoids, with phosphorylation of elF2a and inhibition of 
protein synthesis (111), 

Targeting efF4A and elF4& Antisense RNA 
and Peptides 

Antfesense expression of elF4A decreases the proliferation rate 
of melanoma ceils 01?). Seo^estratlonofelF4Eb 
sfen of 4E-BP1 is proapoptotfc and decreases tumorfgenlclty 
(113, 114). Reduction of elF4E with antisense RNA decreases 
soft agar growth, Increases tumor latency, and Increases the 
nates of tumor doubling times (7). Antisense eiF4E RNA treat- 



ment also reduces the expression of autogenic fectors (11^ 
and has been proposed as a potential acyuvant therapy for heact 
and neck cancera,partiaflarryvi^ i n 
surgical marfifns. Small molecule InWbrfe^ 
4&BP1-blnding domain of e(F4E are proapoptotfc (116) and 
are also being actively pursued. 

Exploiting Selective Translation for Gene Therapy 
A different therapeutic approach that takes advantage of tho 
enhanced cap-dependent translation In cancer cells Is the use 
of gene therapy vectors encoding suicide genes with highly 
structured 5* UTR These mRNAwoukltiius be at a competitive 
disadvantage In normal cells and not translate well, whereas In 
cancer c^ they woiW translate mro 
the introduction of tte^ 

the coding sequence of herpes simplex vims type-1 1hymktin& 
kinase gene, aBows for selective imr^on of herpes simplex 
vfrus r>pe-7 thymidine kinase gene In breast cancer cefl lines 
compared with normal mammary ceB lines and results In se- 
lective sensitivity to ganctatovfr (1 1 7). 

Toward the Future 

Translation b a crucial process in every celL However, several 
alterations In transtatio 

appear to need an aberrantly activated translatjonal state for 
survival, thus allowing the targeting translation 
surprisingly low toxicity. Components of the translations! ma- 
chlnery, such as elF4E, and signal transduction pathways in- 
volved in translation Initiation, such mTOR, represent promising 
targets for cancer therapy, inhibitors of the mTOR have already 
shown some preliminary activity in clinical trials, it is possible 
that with the development of better predictive markers and 



can be Improved. Similar to other cytostatic agents, however, 
mTOR Inhibitors are most Beefy to achieve conical utJBy in 
combination therapy. In the interim, our Increasing understand- 
ing of translation initiation and signal transduction pathways 
promise to lead to the identification of new therapeutic targets 
in the near future. 



Acknowledgments 

We thank Gayfe Nesom from In© University of Texas M, D. Anderson 
Cancer Center Department of Scientific Publications for editorial assist- 
ance and Dr. Elmer Bernstam for assistance with manuscript preparation. 

References 

1. Pestova, T. V., Kolupaeva, V. G., LomaWn, L B„ PHfpenko, E V., Shatsky, 

1. M. Agol, V. I., and Hetten, C. U. Molecular mechanisms of translation 
tnftfatton In eekaryotes. Proa NaU Acad. Set USA* 98: 7029-J036, 2001. 

2. Rosenwafd, !. B., Kaspar, R.. Rousseau, D„ Gehrke, U Lebgufch, p^ 
Chen, 4. J., Schmidt E v., Sonenberg, ri, and London, L M. Eukaryotfc 
translation Initiation factor 4E regulates expression of cycJJn D1 at tran- 
scriptional and post-transcrfptlonal tevels. J. Biol. Cham., 270: 21176- 
21180, 1995. 

3. RosenwaJd, I. B„ Hutder, M. J., Wang, S., Savas, L, and Fraire, A. E 
Expression of eukaryotfc translation Initiation fedora 4E and 2a Is In- 
creased frequently In bronchloIoaJveoIar but not In squamous cell carci- 
nomas of the lung. Cancer (PhBa.), 92: 2164-2171, 2001. 



f O*^. A.,Pe!letier, and Sonenberg, N. Differential efflcfencfea of 

«^1nmslatkmofrnousec-nf^ 

fated region. Proo. Nati. Acad. Sci USA, 52: 2315-5319, 1985. 

& Kczak, M. Influences" of mRNA secondary structure on Initiation by 

eukaryotfc rfoosomes. Proc Natl Acad. 8d USA, 83: 2850-2854, 1988. 

a Kixomilas, A. E., U^rte-Karatzas, A., and Scnenberg, N. mRNAs 

containing extensive secondary stiucture tn their 5' non-coding region 

translate efficiently in ceifa overexpressing initiation factor el WE EMBO 

J„f 1; 4163-4158, 1992. 

7. Wnher-Schaeff^ 

Wk^^ R..E Decreasing the level of translation initiation factor 4E w&h 
snfc^RNA causes nwer^ 

genssbof doned rat embryo fibroblast* Int J. Cancer, 55: 841-847, 1993. 
a. Kevi, C.O,De Benedetti. A, Payne, a rC. Coe, L L, Laroux, F. 3 n 
and Alexander, J. a. Transitional regulation of vascular parmeabfifty 
factor by eukaryotic fnfflaflon factor 4E: implications for tumor angjogsn- 
esls. bit J. Cancer, 65: 785-790, 1996. 

ft. Nathan, C. A, RanHin, a, Abreo, F. W.. Nassar, a. Oe Benedetti. A., 
and Glass, J. Analysis of surgical margins whn the molecular marker 
eiF4E: a prognostic factor mp*^ 
Oncol, 17; 2909-2914, 1999. 

10. Fufomaga, R., ami Hunter, T. MNK1, a new MAP kinase-actfvated 
protein kinase, Isolated by a novel expression screening method for 
Identifying protein kinase substrates. EMBO J., J6V 1921-1933, 1997. 

11. Wasktewfcz, A. X, Rynn, A.. Proud, C. G., and Cooper, J. A. Mitogen- 
activated protein kinases activate the senne/threonlne kinases Mnkl and 
Mnk2. EMBO J, 16: 1909-1920, 1997. 

12. Wang, X, Rynn, A., Wasktevvtaz, A J„ Webb, B. U Vrfes, Ft a 
BeJnes, t A., Cooper. J. A, and Proud, a a The phosphorylation of 
euteryotto tarnation factor elF4E In response to phorbol esters, cell 
stresses, and cytokines Is mediated by distinct MAP kinase pathways. 
J. Bid. Chem., 273: 9373-9377, 1996. 

13. Pyronnet, Irnataka, H., GIngras, A C. Fukunaga, a, Hunter, T, 
and Sonenberg, N. Human eukaryotlc translation Initiation factor 4G 
(elr^iecnto Mnkl to phosphorylateelR&EMBOJ. ra:270-e7ai999. 

14. Wefln, M„ Scheper, G. a, Voorma, H. 0. emf Thomas, A A Regulation 
translation bitiationfac^ 

531-544, 199$. 

15. Raught, B., and GIngras, A C. elF4E activity Is regulated at multiple 
levels. Int J. Blochem. CeR BtoL, 37: 43-57, 1 999. 

16. Tateuchl, It, Shlbamoto, S., Nanamine. »C Shlgemort, L, Omura, S„ 
Kftamura, N., and ho, F. Signaling pathways leaning to transcription and 
translation cooperatively regulate the transient Increase m expression of 
o-Fos protem. J. Blot Chem.. 276V 28077-26083, 2001 . 

17. Kawesome. H., Papst, P., Webb, S;, Keller, CL M., Johnson, G. L r 
GeHand, E. W., and Terada. N. Targeted disruption of p70(e6k) defines Its 
role In protein synthesis and raparnycln sensitivity. Proc. NatL Acad. Scl 
USA, 95r 6033-5038, 1998. 

18. Christie, a R., Hajduch, E., Hundal, H. S., Proud, C. a, and Taylor, 
P. M. Intracellular sensing of amino acids in Xenopus taevts oocytes 
stimulates p70 S6 kinase In a target of rapamycln-dependent manner, 
d. Biol. Chem., 277: 9952-9957, 2002. 

19. Ham. K., Ycxiezawa, IC, Wang, Q. P., KoztowsW, M. T„ Betham, a, and 
Avrucfc J. Ammo acid sufficiency and mTOR regulate p70 S6 Mnase and 
eKME BP1 through a common effector mechanism. J. BfoL Ghent, 273; 
14484-14494,1998. 

20. Graves, L M., Bomfeldt, K. E.. Argast, a M., Krebs. E. a, Kong, XI, 
Un, T. A, and Lawrence, J. C„ Jr. cAMP- and rapamycln-eensmVe reg- 
ulation of the association of eukaryotlc initiation factor 4E and the trana- 
latlona! regulator PHAS-I in aoruo smooth muscle cells, Proo. Natl Acad. 
Sd USA 9Z 7222-7226, 1995. 

21. Men**W.C.,8JKlHeishey i j.W.B.Tta 

eukwyotic protein synthesis. /*• J. W. B. Hershey and M. a Mathews 
(eds.), Transbtional Control, pp. 31-69. Cold Spring Harbor, NY: Cold 
Spring Harbor Uboratory, 1996. 

22. Wmbal, S. a EukaryotJo Initiation factor eIF2. Int J. Blochem. Cell 
Blot, 3f: 25-29, 1999. 

2?' Jagus. R, Joshl, B„ and Barber, G. N. PKR, apoptosb and cancer. 
Int J. Blochem. CeD Biol., 31: 123-138. 1999. 



24. Thomas, a, and Haft M.N. TOR slo^alHngajKi control of ceBmwthi 
Curr. Opm. Cell Biol, 9: 782-787, 1997. 

25. GbgT83,AC. l Raiigm;B^8n^ Regulation of transla- 
tion Initiation by FRAR/mTOR. Genes Dev., 16V 807-82?, 2001. 

26. Ghgjas, A C^ Gygl, S. P., RaugM, B., PoteWewks; a D„ Abraham 
a T^ Hoekstra, M. F., AebereoM, R, and Sonenberg, N. Regulation of 
4E-BP1 phosphorylation: a novel two-step mechanism. Genes Dev., 73 > 
1422-1437,1999. 

27. Kumar, v., Pandey. P. SabatM. O, Kumar, Majumder, P. le, 
Bharti I A,Camik^G M Kufa,D^aixlK^^ 

between RAFTiyFRARmiTOT and protein Wnase C8h the regulation of 
caj><lependent Initiation of translation. EMBO J„ 19: 1087-1097, 2000. 

28. Yang, D. Q., and Kastan, M. a PanJcipation of ATM In rnsuUn signal- 
ling through phosphorylation of e1F^4E-bmdlng protem 1. Nat Celt Biol.. 
2:893-898,2000. 

29. Uu. O, Zhang, Y^ Bode, A M., Ma, W. Y. and Dong, Z. Ptiospho- 
ryfation of 4E-BP1 Is mediated by the D38/MSK1 pathway m response to 
UVB Eradiation. J. Blot Chem., 277; 8810-8816, 2002. 

SO. Neshat. M. S.. MeUinghoff, I. K.. Tran. a. Stiles, B^ Thomas, Q., 
Petersen, a. Frost a, GftJbons, J. J, WU. H., and Sawyers, C.LErv- 
hanced sensitivity of PTENstefldent tumors to Inhibition of FRAP/mTOF^ 
Proc Matt Acad Sol USA, 98: 10314-10319, 2001. 

31. Sekuflc, A, Hudson, C. Homme, J. L, Yin, P, Ottemess. D. M., 
Kamta, U M^ and Abraham, a T. A direct linkage between the phosphol- 
nosmde 3-Hnaso-AKr slgnaing pathway and me rnaremaDantarp^ 

mydn in m^n^timulated and transfbnnedceas. Cancer Res^ 60: 3504- 
3513,2000. 

32. Scott, P. H., and Lawrence, J. C^ Jr. Attenuation of mammalian target 
of rapaniycm activity by Increased cAMP In 3T3-i1 adipocytes. J. Blot 
Chem., 273: 34496^34501, 1998. 

33. Reynolds, t T., Bodine, a C u and Lawrence, J. C. f Jr. Control ©>f 
Ser2448 phosphorylation in the mammalian target of rapamycm by Insulin 
and skeletal muscle load. J. BtoL Chem., 277; 17657-17662, 2002. 

34. Peterson, a TJ, Beat P. A, Comb, M. J., and Schreiber, S. U 
FKBP12^apamycln-assodated protein (FRAP) autophosphorytates at 
serine 2481 under transtattonafly repressive conditions. J. Biol. Chem.. 
275:7416-7423,2000. 

35. Peterson, a T„Desal, a N., Hardwlck, J.S.. and Schreiber, S. L. 
Protem phosphatase 2A Interacts wfm the 70-kDaS8Wmise and Is acti- 
vated by mhlbitfon of FKBP12-rapamycm-assocstarted protem. Proc Natf. 
Acad. Sd. USA, 96: 4438-4442, 1999. 

36. McGrew, L. U DworWn-Rastl, DworWn, M. B„ and Rlchter, J. D. 
Poly(A) elongation during Xenopus oocyte maturation is required for trans- 
lattonal recruhrnent and Is mediated by a short se<Juonce etemern\ Genes 
Dev^ a* 803-81 5, 1989. 

37. Sheets, M. D., Wu, M„ and Wtekens, M. Polyadenyiation of c-mos 
mRNA as a control point In Xenopus meiotic maturation. Nature (bond ) 
374:511-616.1995. * 

38. Varnum. S. M., and Wormlngton, W. M. Deadenylatlon of maternal 
mRNAs during Xenopus oocyte maturation does not require specmc 
cfe-sequences: a default mechanism for translatlonal control. Genes Dev 
4:2278-2286,1890. 

39. Gallie, D. R. The cap and potv(A) tall function synerglstlcaDy to regu- 
late mRNA translational efficiency. Genes Dev^ 6V 2108-2118, 1991. 

40. Sach3, AB.,andVar8J^aEuitaryotiotranala^n iiTWatton: there are 
(at least) two sides to every story. Nat Street. Biol., 7; 358-361, 2000. 

41. WqHfe, A and Merle, F. CoupBng transcription to translation: a 
novel site for the regulation of eukaryotlc gene expression. Int J. Bio- 
chem. Cell Blot., 28: 247-257, 1999. 

42. BrfoHmova, V. M., Wet a U, Sctikov, A Slmonenko, P. N, 
Lazarev, O. A, Vasllenko, K. &, Ustinov, V. A. Hershey, J. W„ and 
Ovchlnntkov, L P. The major protein of messenger ribonucteoprotofc 
particles m somatio oeDs b a member of the Y-box binding transcription 
factor family. J. BtoL Chem, 270: 3166-3192, 1995. 

43. Matsumoto, K., Menc, F., and Wolffe, A P. Translatlonal repression 
dependent on the mteraction of the Xenopus Y-box protein FRGY2 with 
mRNA Role of the cold shock domain, tail domain, and selective RNa 
sequence recognition. J. Blot Chem,, 271: 22708-22712, 1896. 



979 Tronstation Initiation In Cancer 



44. Evclokimova, V n Ruzanov, P.. fmataka, K, Raught. B., Svitklrt, Y.. 
OvcWnrfoov, L P., and Sonenberg, M The major mRNA-assoclaied pro- 
tein YB-1 fe a potent 6' cap-dependent mRNA stabilizer. EMBO J., 20: 
5491-6502,2001. 

45. Saito, H,, Heyday* A. C, WZman, K., Hayward, W. a, and Tonegawa, 
S k ActtaDcnoftf«c-/m/cgera 

control Proa Natl. Acad Set USA, 80: 7476-7480. 1 98a 
4* Nanbru, C lafon, L, Audgfer, a, Gensac, M. a, Vagner, 8., Huez, G., 
and Prats, A. C. Alternative translation of the rjoto<>ncogerte c-rm* by an 
tntamai rtocsorne entry sita J. Bid ChenL, 272: 32061-32066, 1 997. 
4T. Stonetey, M„ PauBn. F. E, Le Quesne, J. P n ChappaB, a A, and 
WBlIs, A E c-Myc 5' untranslated region contains an Internal rfoosome 
entry segment Oncogene, l& 423-420, 1998. 

46. PawOn, F. E, West, M. J„ SuBvan. N. F„ Whitney. Ft L f tyne, L, and 
Wfflte, A E Aberrant transjatfonat control of the o-myc gene In multiple 
myeloma. Oncoeene. 13: 505-513, 1995. 

49. Chapped, & A^ LeQueane, 4. P., Paulin, F. E, de Schooimeester, 
M. L. Stonetey. M„ Soutar, R. |_, Ralston, a H, HeKrlch, M H., and WHO*, 
A E A mutation to the o-myc-fRE3 leads to enhai^ Internal r&osome 
omry In muftpia myotome: a novel mechanism of oncogene do-regulation. 
Oncogene, 19: 4437-4440, 2000. 

50. SJgnort, E, Bagnl, C., Papa, S., Prfmereno, a, Rfnakfi, M, Amakil, F., 
and Fazio. V. M A. A somatic mutation tn the 5*UTR of BRCA1 gene In 
sporadte breast cancer causes down-modulatton of translation efficiency. 
Oncogene, 20: 4596-4600, 2001. 

51. Uu.L,Dawcfth,a. Gao.L, Monzon, J., Summers, A, Lassam, N., and 
Hogg, IX Mutation of the CDKN2A 5' UTR creates en aberrant Initiation 
codon and pndfeposesto meiancma. Nat Genet. 2*: 128-132, 1999. 

62. Savftsky, rC, Platter, Uzfel.T., Quad, a, SartietA^ Rosenthal A-. 
Broy-Stem, 0„ ShBoh, Y., end Rotman, Q. Ataxfartefengfeotasia: struc- 
tural diversity of untranslated sequences suggests complex post-tran- 
scrlpfonal regulation of ATM gene expression. Nudete Acids Res., 25: 
1678-1684, 1997. 

63. Brown, C. Y., MIze, G. J., Pineda, M, George, D. L, and Moms, DJ R. 
Role of two upstream open reading frames in the translation^ control of 
oncogene mdmZ, Oncogene. 78: 6631-6637, 1999. 

54. Sobczak, IC, end Krzyzosiak, W. J. Structural determinants of BRCA1 
translatfonal regulation. J. Biol. Chenu, 277:- 17349-17358, 2002. 
55* Arrlck, 8. A., Grendell, R. L, and Grrffrn, L A. Enhanced transtattonai 
efficiency of a novel transforming growth factor 63 mRNA m human breast 
cancer cells. Mot. CeB. Blot, 14: 61 9-628, 1994. 
58. Myers, a M., Eng, C., Ponder, B. A^ and Mulligan, L M, Character- 
ization of RET proto-oncogene 3' splicing vartants and poryadenylatton 
sites: a novel C-terrnfnus tor RET. Oncogene, 11: 2039-2045, 1995. 

57. Byrne, J. A., Tomasetto, C., Rouyer, N., BeOocq. J. P., Rio. M. <X, end 
Basset, P. The tissue Inhibitor of metaItopmtefnases-3 gene In breast 
carcinoma- Wentfflcation of multiple poryadenylatlon sites and a stromal 
pattern of expression. Mot Med., 1: 418-427, 1995. 

58. Moscow, J. A., He, R. t Gudas, J. M., and Cowan, K. H- Utilization of 
muhlple poryadenylatlon signals in the human RHOA protooncogene. 
GeneiAmst.), 144: 229-236, 1994. 

59. SenterreHjesentants, a, Alag, A. S, and Sobel, M. E Multiple mRNA 
species are generated by alternate 

oduPn-4 gene. J. CeiL Blochem., 58: 445-454, 1995. 

60. Lazaris-Karatzas, A., Montine, K. a, and Sonenberg, N. Malignant 
trajwtermation by a eukaryottelnftration factor subunft tlwt Wn* mRNA 
5' cap. Nature (Lend.), 345.- 544-647, 1990. 

61. DeBerarietf,A.,arwlRfioads f REC 
teinsymtesfe Inflate 

morphok>®r. Proa. NatL Acad. Sd USA, 87: 8212-8216, 1990. 

62. Potunoveky, V. A.. RosenwaW. I B„Tan, A.T., White, J„ Chiang, L, 
Sonenberg, N., and Bftterrnan, P. B. TrenslatJonaJ control of programmed 
cell death: eukaryotto translation Initiation factor 4E blocks apoptosfe In 
B^wth-factor-restrtcted fibroblasts with phystologtcalry expressed or de- 
regulated Myo. Mol. CeB. Biol., T6; 6573-6581, 1996. 

63. Mfyagt Suglyama, A., Asat An OkazaW, KucWno, Y. and Kerr, 
S,J. Bevatedleveb of eukaryotto translation Woatlon factor elF-4E. mRNA in 
a broad spectrum of trensforr^ 



64. Kerekatte, V M Smiley, IC, Hu, B., Smith, A, Gelder, F^ and Do 

Benedetto A. The protc-oncoger^ a survey of Its 

expression In breast cardromas. Int J. Cancer, 64: 27-31, 1^5. 

66. U B. D., Uu, L, Dawson, M., and De Benodottl, A Overexprossion erf 

eukoryotfc Inroad fectw 

73:2385-2390.1897. 

66. Sorre8a,D.L, Black, D.R^Meschor^C.Rrwao^R.DeBer^o^A.. ' 
Gao, M^ Wimams, a and U, & D. Detection of g3F4B gene ampifficatfori 
In breast cancer by compeu^ve PCR Arm. Surg. OncoL, 5c 232-237, 199S_ 

67. U.B.D., McDonald, J. C„ Nasser, R^ and De Benedetti, A. Cnnteal 
outcome In stage I to Id brea3t cardnorna ami g!F4E overexpressloa Ann_ 
Surg^ 227i 758-781; discussion, 761-763, 1098. 

68. Wang, a, RosenwaW, L a, Hutzler, M. Pihan, a A^ Savas, U. 
Chen, J. J., and Woda, a A. Expression of the eukaryotto translation 
WBaHon factors 4E and 2a tn non-Hoo^kto's rymphomas. Am. J. PathoL, 
155: 247-^55. 1999. 

69. FukucW-Shlmogort, T.. bhB, L, Kashhvagl, tC, Mashiba, a, Ddrnofc>. 
H„ and Igarashl, K. MaHgnam trareformatton by overproduction of trene>- 
lation birnatton factor elF4a Cancer Res., 57: 5041-6044, 1997. 

70. Bberte, J., KrasagaWs, K, and Orfanos, C. E Translation mttiatkm 
tactor elF-4A1 mRNA is consfstenby overexpressed In human melanoma 
ceBs to vftro. Int J. Cancer, 71: 398-401, 1997. 

71. Shuda, M^ Kondoh, M, Tanaka, K., Ryo, A, Waxatsukt T^ Hada, A., 
GoseW, N, rgarf, T., Hatsuse, IC, Aihara, T n HorfucH a, SWchfta. M.. 
YamamotDv M, and Yeiriamoto, M. Enhanced expression of translation tactor 
rnRNAahhqpatoceQ^ 200O. 

72. Nupponen, M M, Porkka, K.. Kakkota, U Tanner, M, Persson, 
Borg, A., Isota, «t, end Vteakorpi, T. Amptmcabon and overexpresslon of 
p40 subunrt of eukaryotic translation inrBatfon factor 3 tn breast and 
prostate cancer. Am, J. Pathol, 154: 1777-1783,1999. 

73. Rethe, M., Ko, Albors. P„ and Wernert, N. EukaryoHo mKiatton 
factor 3 p110 mRNA Is overexpressed In testicular seminomas. Am. *J. 
Pathol, 157; 1597-1604, 2O0O. . 

74. Bartund. M., Fcrozan. F„ Kononen, J. ( Bubendorf, U Chen, Y„ Bltt- 
ner, M. L, Torborst J., Haas, P., Bucher, a, Sauter. Q., KalDonieml, a P., 
and KeWonleml. A. Detecting activation of ribosomal protein S6 kinase by 
complementary DNA and tissue mk^oarray analysis. J. Natl Cancer Inst 
(Bethesda), 92: 1252-1259, 2000. 

75. TopaHan, S. L.Kaneko, S., Gonzales, M. L, Bond, G. L, Ward, Y. f and 
Manley, X U IdentJflcation and functional characterization of nec-pofy(A) 
polymerase, an RNA processing enzyme overexpressed In human tumors. 
Mol Cefl. BroL. 21: 5614-5623, 2001. 

76. SconTas, A^ TaJIerl, M., Ardavanls, A, Courtis, N., Omttrtacfo, E> Yotfe, J., 
TsJapans, C M, and Trangas, T. Poryadenylate pctymerase enzymatic ac- 
tMty \n mammary tumor cytosofe: a new Independent prognostic marker in 
primary breast cancer. Cancer Res., 60: 5427-5433, 2000. 

77. Janz, M., Harbeck, N., Dettmar, P., Berger, U., Schmkft, A, Jurchott, K., 
Schmitt, M., and Royer T K. D. Y-4jox factor YB-1 preoTctsdrug resistance and 
patient outcome In breast cancer Independent of ethically relevant tumor 
biologic factors HER2, uPA and PAI-1 . Int J. Cancer, 97: 278-282, 2002, 

78. Shibahara, K-. Suglo, K., OsaW. T., Ucfrluml, T., Maebara, Y., Kohno. 
K., Yasumoto, K., Sugbnacrd, K., and Kuwano, M. Nuclear expression of 
the Y-box binding protein, YB-1 f as a novel marker of disease r^rogresslon 
in non-small cell lung cancer. CSn. Cancer Res., 7: 3151-3155, 2001. 

79. Kamura, T., Yahata. H., Amada, a, Ogawa, S., Sonoda. T., Kober- 
yashi, H., Mteumoto, M., Kohno, K, Kuwano, M., and Nakano, a ts 
nuclear expression of Y box-binding protem-1 a new prognostic factor in 
ovarian serous adenocarcinoma? Cancer (Phi la.), 85; 2450-2454, 1999. 

80. Bargou. R. a, Jurchott. IC, Wagener, C„ Bergmann, S.. Metzner, S„ 
Bommert, K., Mapara, M. Y., Winzer, K. J., Dfetel, M.. Dorken, B., and 
Roy or, H. D. Nuclear localization and Increased levels of transcription 
factor YB-1 tn primary human breast cancers are associated with irrtrtnsJc 
MOW gene expression. Nat Med., fit 447-450, 1897. 

81. Aokt M.. Blazek, E. and Vogt P. K. A role of the kinase mTOR in 
ceDutar transformation Induced by the oncoproteins P3k and Akt Proa 
Natl. Acad. Sol. USA, 96: 136-141, 2001. 

82. Ch0d,S.J.,M10er,M.K.,andGebaite ? A.P. C^n typ^epemlem and 
-Independent control of HER-2/neu translation. Int J. Btochem. Cefl BloL, 
37:201-213,1999. 



83. Jefteries, a a. Relnhard, C Kozma, 8. C and Thomas. 0. Rapa- 
mydn selectively represses translation of the •polypyrtrnkftw tract" 
mRNAfamHy, Proa Natt Acad. Set USA, 9f: 4441-4446, 1994. 

84. Terada, fi, Petal, K a, Tajtase, K., Kohno, 1C, Nairn, A. C and 
Qetfend.E W. Rapamydn selectively Inhibits translation of mRNAs en- 
coding elongation factors and ribosomal proteins. Proc. Natl Acad. ScL 
USA, 81: 11477-11481, 1994. 

85. Jeflerles, H. B., FumagaM, 3., Dermis, P. B„ Relnhard, C, Pearson, 
ft B„ and Thomas, a Ftepamydn suppresses STOP mRNA translation 
through inhibition of p70s6Jc &1B0 J., 16: 3693-3704, 1997. 

88. Beretta, L, Glngras, A. C, Svftkin, Y. V., HaS, M. N., and Sonenberg, 
N. Rapamydn blocks the phosphorylation of 4E-BP1 and inhibits cap- 
dependent initiation of translation. EMBO J., 15; 658-664, 1996. 

87. Hidalgo, and Rcwinsky, E. K. The rapamydn-senattive signal 
transduction pathway as a target for cancer therapy. Oncogene, 19: 
6680-6688,2000. 

88. Hoscil^,DiBIng,M.a,SI^ 

Germain, a a, Abraham, R T., and Houghton, P. J. Rapamydn causes 
poorfy reversible Inftofta 

h human rhabdomyosareoma cefe. Cancer Roo^ 69: 888-894, 1999. 

89. Huang, a, and Houghton, P. J. Resistance to rapamydn: a novel 
anticancer drag. Cancer Metastasis Rev., 2fc 69-78, 2001. 

90. Geoergar, a, Kerr, It, Tang, d B., F=ung, K. M., PcwaS, B. r Sutton, 
UM, PhBflps, P. C, and Janss, A. J. Antitumor activity of the rapamydn 
analog OCf-779 bi human primitive neuroectodermal tumor/meduDoblas- 
toma modete as single agent and In combination chernomerapy. Cancer 
Res,, $1: 1527-1632, 2001. 

91. Gibbons, J. J., Discafanl, a, Peterson, R., Hernandez, a, SkotnJdd, 
A, and Frost, P. The effect of CCI-779, a novel rnacroBde anti-tumor 
agent, on the growth of human tumor cells in vitro and in nude mouse 
xenografts tovho. Proc. Am. Assoc Cancer Res., 40: 301, 1999. . 

92. Hidalgo, VL, Rowlnsky, a, Eritehman. C, MarsnaB, B.. Mart®, a, 
Edwards, T., and Buckner, J. J. A Phase 1 and pharmacological study of 
CCI-779 cycte Inhibitor. Ana Oncot, 11 (SuppL 4); 133, 2001. 

93. Alexandre, J., Raymond, E,. Dopenbrock, H», Makhaldl, S„ Angevin, 
E-t Pafflet, C, Hanauske, A., Prison, J., Feussner, A-. and Amiand, J. P. 
CCI-779, a new rapamydn analog, has antitumor activity at doses Induc- 
ing orrfy mfld cutaneous effects and mucositis: earty results of an ongoing 
Phase I study. Proceedings of the 1 999 AACR-N d-EOFTTC imernatfonaJ 
Conference, Clin. Cancer Res., 5 (Suppl.): 3730s, 1999. 

94. Chan. a, Johnston, S^Scheulen, M. E, Mross, It, Morant, A^ Lehr, 
A* Feussner, A, Berger, M„ and Wrech, T. Rrat report a Phase 2 study 
of the safety and activity of CCI-779 for patterns with locally advanced or 
metastatio breast cancer faffing prior chemotherapy. Proo. Am. Soc Cfln. 
Oncol., 21; 44a, 2002. 

95. AtWna, M. a, HWaJgo, M., Stadler, W., Logan, T., Dutcher, J. P., Hudes, 
G, Park, Y., Marshall B., Bonl, J., and Dukart, G. A randomized double-blind 
Phase 2 study of Intravenous CCI-779 administered weekly to patients with 
advanced renal ceB carcinoma. Proc. Am, See. Clin. Oncol., 21: 10a, 2002. 

96. Smith, S, a, Trlnh, C. M. ( Inge, U J., Thomas, Q., Cloughsey, T. F., 
Sawyers, C. U and MIschel, P. S. PTE*! expression status predicts 
glioblastoma cell sensitivity to CCI-779. Proc. Am. Assoc. Cancer Res., 
43: 339, 2002. 

97. Yu, It, Toral-Barza, L, Discafanl, Q„ Zhang, W. G, Skotnldd, J., 
Frost, P., and Gibbons, J. J. mTOR, a novel target In breast cancer, the 
effect of CCI-779, an mTOR Inhibitor, In predinical models of breast 
cancer. Endocr.Relat Cancer, & 249-258, 2001. . 

98. DItling, M. a, Germain, G. a, Dudkln, L, Jayaraman, A. L, Zhang. >t, 
Harwood, F. C, and Houghton, P.J. 4&bind"ng proteins, thesuppressora of 
eukaryotio Wtfaiton lector 4E, are downregulated in cete wfth acquired or 
hTtrtnslo resistance to rapamyctn. J. BloL Cbenv, 277: 13907-13917, 2002. 

99. Guba, M, von Breftenbuch, P., Stelnbauer, M., KoeW, G., Hegel, S., 
Homung, M„ Buns, C. J., Zuelke, a. Farkas, S., Anthuber, M„ Jauch, 
K. W., and Getsster. E. ft Rapamydn Inhibits primary and metastatio 
tumor growth by antlanojogenesls: Involvement of vascular endothelial 
growth factor. Nat Med., a* 128-136, 2002. 



100. Lane, H. A., Schafi, a, Theuer, A., CyReffly. T„ and Wood, J. Anti- 
angiogenic activity of RAD001, an oraBy active anticancer agent Proc. 
Am. Assoc Cancer Res., 43: 184, 2002* 

101. Maeshlma, Y., Sudhakar, A^ Lively, J. C, UeW, IC, Kharbanda, 3., 
Kahn, C. R, Sonenberg, N., Hynes, a O., and Katturt, a Tumstatin. axi 
endothelial cell- specific Inhibitor of protein synthesis. Science (Wash. DC) . 
29& 140-143, 2002. 

102. Caygni, C. P., Chartett, A., and Hill, M. J. Fat, fob, fish oH and cancor. 
Br. J. Cancer. 74: 159-164, 1996, 

103. Falc0f>er,J.S„Ros*,J.A,F6an^ 

M. G., and Carter, D. C. Effect of eJcosapentaenrto acid and other fatty 
adds on the growth in vitro of human pancreatic cancer cefl lines, Br. J* 
Cancer, 69: 826-832. 1994. 

104. Noguchl, M., MlnamU ivt, Yagasaid, a, KbioshUa. K., Earashi. M., 
Kitagawa, H. ? Taruya, T M and MlyazaW, t Ctiemopreverrbon of DMBA- 
induced mammary carcmogenesis In rats by low-dose EPA and DHA. 
Br. J. Cancer, 75: 348-353, 1997. 

105. PaiakurtW, S. FtucWger, a, Aktas, H., ChangoUtar^ A. K., Shah- 
safaef. A., HameJt, S., KUlc, &, and Halperin, J. A. inhibition of translation 
Initiation mediates the anticancer effect of the n-3 poiyunsaturated fatty 
acid efcosapentaertoic add. Cancer Res^ GO: 2919-2925, 200a 

106. Aktas, H., Rucidger, Acosta, J. A, Savage, J. M., Pafakurthi, a S., 
and Kdpen^X A. Depletion of in^ 

e4F2a^ and sustained tnHbffionoftiansiatton 

effects of ciolrtmazDte. Proa Mali Acad. Sd USA. 95: B280- S285, 1998. 

107. MhasWIkar, A. M., Schrock, a D., Hindi. M., Uao. Sieger,' K,, 
Kourouma. F„ Zou-Yang, X Onlshi, E, Takh, O.. Vedvtek, T. 3.. 
Fanger, G n Stewart, L, Watson, Q. J., Snary, D, Fisher, P. a, Saeki, T., 
Rom, J. A., Ramesh, a, and Chada, a Melanoma differentiation associ- 
ated gene-7 (mde-7J: a novel antHumor gene for cancer gene therapy. 
Mot Med., 7: 271-282. 2001. 

108. Su, Z. Z., Madlreddi, M. T„ Un. J. J., Young, C. a, Kftada. a, Reed, 
J. C«, Goldstein, N. L, and Fisher, P. & The cancer growth suppressor 
gene rmfe-7 selectively Induces apoptosis In human breast cancer cells . 
and Inhibits tumor growth in nude mice. Proc. Natt Acad. ScL USA, 95: 
1440O-14405, 1998. 

109. Saeki, T., Mhashilkar; A^ Chada, S., Branch, C. Roth, J. A., and 
Ramesh, R. Tumor-suppressfve effects by adenovlrua-medtated mda-7 
gene transfer n non-small ceil lung cancer cefl In vitro. Gene Ther,, 7: 
2051^057,2000. 

110. Pataer, A* Vorburger, a A., Barber, a N. ( Chada. a. Mhashilkar, 
A. Zou-Yang, H^ Stewart, A. L, Balachandran, S., Roth, J. A., Hunt, 
K. It, and Swisher, a a Adenoviral transfer of the melanoma different! 
atlon-essoclated gene 7 (mda7) Induces apoptoBls of lung cancer cells via 
up-regulation of the double-stranded RNA-dependent protein kinase 
(PKR). Cancer Res,, 62: 2239-2243, 2002. 

111. Ho, T., Wamken, S. P., and May, W. S. Protein synthesis Inhibition by 
ftavondds: roles of eukaryotte mitiation factor 2a kinases. Btochem. Bto- 
phys. Res. Commuru, 255: 689-594, 1999. 

112. Eberte, J^ Fecker, L F„ Bittner, J. U., Orfanos, C. E, and GeOen, C. C. 
Decreased proliferation of human melanoma cell lines caused by antisense 
RNA against translation factor elF-4A1 . Br. J. Cancer, 8Sr 1957-1962, 2002. 

1 1 3. Polunovsky, V. A., Glngras, A. C^ Sonenberg, N., Peterson, M., Tan, A, 
Rubins, J. a, ManIvoI,J.C^andBmennan, P. B. TraroIatkjnaJ control of the 
antlapoptotic function of Ras. J. BloL Chem., 275: 2477B-24780, 2000. 

114. D'Cunha, J„ Kratzke, M. O, Alter, M D M Polunovsky, V. A, BIttsrman, 
P. B., and Kratzke, a A. Over-expresskm of the translations! repressor 
4E-BP1 Inhibits NSCLC tumongOTidty In vivo. Proc Am. Assoc. Cancer Res., 
43:818-817,2002. 

1 1 5. OeFatta, R. J„ Nathan, C. A., and De Benedetto*, A. Antisense RNA to 
elF4E suppresses oncogenic properties of a head and neck squamous 
cell carcinoma cell Una Laryngoscope, 1 10: 028-933. 2000. 

116. Herbert, T. P., Fahraeus, a, Presoott. A^ Una, D. P., and Proud. 
C. a Rapid induction of apoptosis mediated by peptides that bind Initi- 
ation factor elF4E. Cum Biol., 10: 793-798, 2000. 

117. DeFetta, R. J M U, Y„ and De Benedettl, A. Seiecttve killing of cancer 
cells based on translatlonal control of a suldde gene. Cancer GeneTher., 
9: $73-678, 2002. 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 

! 



